STREPTOCOCCUS PNEUMONIAE PROTEINS AND NUCLEIC ACID MOLECULES 

The present invention relates to proteins derived from Streptococcus pneumoniae, 
nucleic acid niolecules encoding such proteins, the use of the nucleic acid and/or 
proteins as antigens/immunogens and in detection/diagnosis, as well as methods for 
screening the proteins/nucleic acid sequences as potential anti-microbial targets. 

^treptococcus pneumoniae, comnionly referred to as the pneumococcus, is arf 
important pathogenic organism. The continuing significance of Strept0^ocus 
pneumoniae infections in relation to human disease in developing ana developed 
countries has been authoritatively reviewed (Fiber, G.R., Scienp/^265: 1385-1387 
(1994)). That indicates that on a global scale this organisfn is believed to be the 
most common bacterial cause of acute respiratory infections, and is estimated to 
result in 1 million childhood deaths each ye^r; mostly in developing countries 
(Stansfield, S.K., Pediatr, Infect, Dis., 6y^2 (1987)). In the USA it has been 
suggested (Breunan et al. Arch, Jmem, Med,, 150: 1401 (1990)) that the 
pneumococcus is still the most common cause of bacterial pneumonia, and that 
disease rates are panicularlv^gh in young children, in the elderly, and in patients 
with (predisposing condjjlons such as asplenia, heart, lung and kidney disease, 
diabetes, alcoholism^or with immunosupressive disorders, especially AIDS. These 
groups are at higKer risk of pneumococcal septicaemia and hence meningitis and 
therefore have/ a greater risk of dying from pneumococcal infection. The 
pneumococcus is also the leading cause of otitis media and sinusitis, which remain 
prevalent/infections in children in developed countries, and which incur substantial 
costs. / 

-The need for effective preventative strategies against pneumococcal infection is 
highlighted by the recent emergence of penicillin-resistant pneumococci. It has been 
reponed that 6.6% of pneumoccal isolates in 13 US hospitals in 12 states were found 



to be resistant to penicillin and some isolates were also resistant to other antibiotics 
including third generation cyclosporins (Schappert, S.M., Vital and Health Statistics 
of the Centres for Disease Control/National Centre for Health Statistics, 214:1 
(1992)). The rates of penicillin resistance can be higher (up to 20%) in some 
5 hospitals (Breiman et al, J. Am, Med. Assoc., 271: 1831 (1994)). Since the 
development of penicillin resistance among pneumococci is both recent and sudden, 
coming after decades during which penicillin remained an effective treatment, these 
findings are regarded as alarming. 



10 For the reasons given above, there are therefore compelling grounds for considering 
improvements in the means of preventing, controlling, diagnosing or treating 
pneumococcal diseases. 

Various approaches have been taken in order to provide vaccines for the prevention 
15 of pneumococcal infections. Difficulties arise for instance in view of the variety of 
serotypes (at least 90) based on the strucmre of the polysaccharide capsule 
surrounding the organism. Vaccines against individual serotypes are not effective 
against other serotypes and this means that vaccines must include polysaccharide 
antigens from a whole range of serotypes in order to be effective in a majority of 
20 cases. An additional problem arises because it ahs been found that the capsular 

polysaccharides (each of which determines the serotype and is the major protective 
antigen) when purified and used as a vaccine do not reliably induce protective 
antibody responses in children under two years of age, the age group which suffers 
the highest incidence of invasive pneumococcal infection and meningitis. 



25 



A modification of the approach using capsule antigens relies on conjugating the 
polysaccharide to a protein in order to derive an enhanced immune response, 
particularly by giving the response T-cell dependent character. This approach has 



been used in the development of a vaccine against Haemophilus influenzae. There 
are issues of cost concerning both the multi-poly saccharide vaccines and those based 
on conjugates. 

A third approach is to look for other antigenic components which offer the potential 
to be vaccine candidates. In the present application we provide a group of proteins 
antigens which are secreted/exported proteins. 

Thus, in a first aspect the present invention provides a Streptococcus pneumoniae 
protein or polypeptide having a sequence selected from those shown in table 2 
herein, , 

A protein or polypeptide of the present invention may be provided in substantially pure 
form. For example, it may be provided in a form which is substantially free of other 
proteins. 

In a preferred embodiment, a protein or polypeptide having an amino acid sequence as 
shown in Table 3 is provided. 

The invention encompasses any protein coded for by a nucleic acid sequence as shown 
in Table 1 herein. 

As discussed herein, the proteins and polypeptides of the invention are useful as 
antigenic material. Such material can be "antigenic" and/or ** immunogenic** . 
Generally, "antigenic" is taken to mean that the protein or polypeptide is capable of 
being used to raise antibodies or indeed is capable of inducing an antibody response in 
a subject. "Immunogenic*' is taken to mean that the protein or polypeptide is capable of 



elicidag a protective immune response in a subject. Thus, in the latter case, the protein 
or polypeptide may be capable of not only generating an antibody response and in 
addition non-antibody based immune responses. 

5 



The skilled person will appreciate that homologues or derivatives of the proteins or 
10 polypeptides of the invention will also fmd use in the context of the present invention, 
ie as antigenic/ immunogenic material. Thus, for instance proteins or polypeptides 
which ,include one or more additions, deletions, substitutions or the like are 
encompassed by the present invention. In addition, it may be possible to replace one 
amino acid with another of similar "type". For instance replacing one hydrophobic 
15 amino acid with another. One can use a program such as the CLUSTAL program to 
compare amino acid sequences. This program compares amino acid sequences and 
fmds the optimal alignment by inserting spaces in either sequence as appropriate. It is 
possible to calculate amino acid identity or similarity (identity plus conservation of 
amino acid type) for an optimal alignment. A program like BLASTx will align the 
20 longest stretch of similar sequences and assign a value to the fit. It is thus possible to 
obtain a comparison where several regions of similarity are found, each having a 
different score. Both types of analysis are contemplated in the present invention. 

In the case of homologues and derivatives, the degree of identity with a protein or 
25 polypeptide as described herein is less important than that the homologue or derivative 
should retain its antigenicity or immunogenicity to streptoccocus pneimioniae. 
However, suitably, homologues or derivatives having at least 60% similarity (as 
discussed above) with the proteins or polypeptides described herein are provided. 



Preferably, homologues or derivatives having at least 70% similarity, more preferably 
at least 80% similarity are provided. Most preferably, homologues or derivatives 
having at least 90% or even 95% similarity are provided. 

In an alternative approach, the homologues or derivatives could be fusion proteins, 
incorporating moieties which render purification easier, for example by effectively 
tagging the desired protein or polypeptide. It may be necessary to remove the "tag** or 
it may be the case that the fusion protein itself retains sufficient antigenicity to be 
useful. 

In an additional aspect of the invention there are provided antigenic fragments of the 
proteins or polypeptides of the invention, or of homologues or derivatives thereof. 

For fragments of the proteins or polypeptides described herein, or of homologues or 
derivatives thereof, the simation is slightly different. It is well known that is possible to 
screen an antigenic protein or polypeptide to identify epitopic regions, ie those regions 
which are responsible for the protein of polypeptide's antigenicity or immunogenicity. 
Methods for carrying out such screening are well known in the art. Thus, the fragments 
of the present invention should include one or more such epitopic regions or be 
sufficiently similar to such regions to retain their antigenic/immunogenic properties. 
Thus, for fragments according to the present invention the degree of identity is perhaps 
irrelevant, since they may be 100% identical to a particular part of a protein or 
polypeptide, homologue or derivative as described herein. The key issue, once again, is 
that the fragment retains the antigenic/immunogenic properties. 

Thus, what is important for homologues. derivatives and fragments is that they possess 
at least a degree of the antigenicity /immunogenicity of the protein or polypeptide firom 
which they are derived. 



V , \ 



Gene cloning techniques may be used to provide a protein of the invention in - 
substantially pure form. These techniques are disclosed, for example, in J. Sambrook 
et al Molecular Cloning 2nd Edition, Cold Spring Harbor Laboratory Press (1989). 
5 Thus, in a fourth aspect, the present invention provides a nucleic acid molecule 
comprising or consisting of a sequence which is: 

any of the DNA sequences set out in Table 1 or their RNA equivalents; 

a sequence which is complementary to any of the sequences of (i); 

a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

a sequence which is has substantial identity with any of those of (i), (ii) 
and (iii); 

a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 1 . 

In a fifth aspect the present invention provides a nucleic acid molecule comprising or 
consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 4 or their RNA equivalents; 

25 

(ii) a sequence which is complementary to any of the sequences of (i); 



=1, 
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(i) 

(ii) 

,(iii) 



15 



(iv) 



(V) 



(iii) a sequence which cedes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is has substantial identity with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a protein 
as defined in Table 4. 

The nucleic acid molecules of the invention may include a plurality of such sequences, 
and/or fragments. The skilled person will appreciate that the present invention can 
includci novel variants of those panicular novel nucleic acid molecules which are 
exemplified herein.... Such variants are encompassed by the present invention. These 
may occur in nature, for example because of strain variation. For example, additions, 
substitutions and/or deletions are included. In addition, and particularly when utilising 
microbial expression systems, "one may wish to engineer the nucleic acid sequence by 
making use of known preferred codon usage in the particular organism being used for 
expression. Thus, synthetic or non-naturally occurring variants are also included within 
the scope of the invention. 

The term "RNA equivalent" when used above indicates that a given RNA molecule has 
a sequence which is complementary to that of a given DNA molecule (allowing for the 
fact that in RNA "U" replaces "T" in the genetic code). 

When comparing nucleic acid sequences for the purposes of determining the degree of 
homology or identity one can use programs such as BESTHT and GAP (both from the 
Wisconsin Genetics Computer Group (GCG) software package) BESTFIT, for 
example, compares two sequences and produces an optimal alignment of the most 



similar segments. GAP enables sequences to be aligned along their whole length and 
finds the optimal alignment by inserting spaces in either sequence as appropriate. 
Suitably, in the context of the present invention compare when discussing identity of 
nucleic acid sequences, the comparison is made by alignment of the sequences along 
5 their whole length. 

Preferably, sequences which have substantial identity have at least 50% sequence 
identity, desirably at least 75% sequence identity and more desirably at least 90 or at 
least 95% sequence identity with said sequences. In some cases the sequence identity 
10 may be 99% or above. 

Desirably, the term "substantial identity" indicates that said sequence has a greater 
degree of identity with any of the sequences described herein than with prior art nucleic 
acid sequences. 

15 

It should however be noted that where a nucleic acid sequence of the present invention 
codes for at least part of a novel gene product the present invention includes within its 
scope all possible sequence coding for the gene product or for a novel part thereof. 

20 The nucleic acid molecule may be in isolated or recombinant form. It may be 
incorporated into a vector and the vector may be incorporated into a host. Such vectors 
and suitable hosts form yet further aspects of the present invention. 

Therefore, for example, by using probes based upon the nucleic acid sequences 
25 provided herein, genes in Streptococcus pneumoniae can be identified. They can then 
be excised using restriction enzymes and cloned into a vector. The vector can be 
introduced into a suitable host for expression. 



Nucleic acid molecules of the present invention may be obtained from S, pneumoniae by 
the use of appropriate probes complementary to pan of the sequences of the nucleic 
acid molecules. Restriction enzymes or sonication techniques can be used to obtain 
appropriately sized fragments for probing. 

5 

Alternatively PGR techniques may be used to amplify a desired nucleic acid sequence. 
Thus the sequence data provided herein can be used to design two primers for use in 
PGR so that a desired sequence, including whole genes or fragments thereof, can be 
targeted and then amplified to a high degree. One primer will normally show a high 
10 degree of specificity for a first sequence located on one strand of a DNA molecule, and 
the other primer will normally show a high degree of specificity for a second sequence 
: 2 located pn the complementary strand of the DNA sequence and being spaced from the 

complementary sequence to the first sequence. 

i 3 15 Typically primers will be at least 15-25 nucleotides long. 

'"••-3: 

=::3 As a further alternative chemical synthesis may be used. This may be automated. 

i=ij Relatively short sequences may be chemically synthesised and ligated together to 

provide a longer sequence. 

|=- 20 

In yet a further aspect the present invention provides an immunogenic/antigenic 
composition comprising one or more proteins or polypeptides selected from those 
whose sequences are shown in Tables 2-4, or homologues or derivatives thereof, 
and/or fragments of any of these. In preferred embodiments, the 
25 immunogenic/antigenic composition is a vaccine or is for use in a diagnostic assay. 

In the case of vaccines suitable additional excipients, diluents, adjuvants or the like 
may be included. Numerous examples of these are well known in the art. 



It is also possible to utilise the nucleic acid sequences shown in Table 1 in the 
preparation of so-called DNA vaccines. Thus, the invention also provides a vaccine 
composition comprising one or more nucleic acid sequences as defined herein. The - 
5 use of such DNA vaccines is described in the art. See for instance, Donnelly et al , 
Ann, Rev, ImmunoL, 15:617-648 (1997). 

As already discussed herein the proteins or polypeptides described herein, their 
homologues or derivatives, and/or fragments of any of these, can be used in methods 
of detecting/diagnosing S.pneumoniae, Such methods can be based on the detection 
of antibodies against such proteins which may be present in a subject. Therefore the 
present invention provides a method for the detection/diagnosis of S. pneumoniae 
which comprises the step of bringing into contact a sample to be tested with at least 
one protein, or homologue, derivative or fragment thereof, as described herein. 
Suitably, the sample is a biological sample, such as a tissue sample or a sample of 
blood or saliva obtained from a subject to be tested. 

In an alternative approach, the proteins described herein, or homologues, derivatives 
and/or fragments thereof, can be used to raise antibodies, which in turn can be used 
to detect the antigens, and hence S, pneumoniae. Such antibodies form another aspect 
of the invention. Antibodies within the scope of the present invention may be 
monoclonal or polyclonal. 

Polyclonal antibodies can be raised by stimulating their production in a suitable animal 
25 host (e.g. a mouse, rat, guinea pig, rabbit, sheep, goat or monkey) when a protein as 
described herein, or a homologue, derivative or fragment thereof, is injected into the 
animal. If desired, an adjuvant may be administered together with the protein. Well- 
known adjuvants include Freund's adjuvant (complete and incomplete) and aluminium 
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hydroxide. The antibodies can then be purified by virtue of their binding to a protein as 
described herein. 

Monoclonal antibodies can be produced from hybridomas. These can be formed by 
fusing myeloma cells and spleen cells which produce the desired antibody in order to 
form an immortal cell line. Thus the well-known Kohler & Milstein technique {Nature 
256 (1975)) or subsequent variations upon this technique can be used. 

Techniques for producing monoclonal and polyclonal antibodies that bind to a 
particular polypeptide/protein are now well developed in the art. They are discussed in 
standard immunology textbooks, for example in Roitt et al. Immunology second edition 
(1989),, Churchill Livingstone, London. 

In addition to whole antibodies, the present invention includes derivatives thereof which 
are capable of binding to proteins etc as described herein. Thus the present invention 
includes antibody fragments and synthetic constructs. Examples of antibody fragments 
and synthetic constructs are given by Dougall et al in Tibtech 12 372-379 (September 
1994). 

Antibodv fragments include, for example. Fab, F(ab')2 and Fv fragments. Fab 
fragments (These are discussed in Roitt et al [supra] ). Fv fragments can be modified 
to produce a synthetic construct known as a single chain Fv (scFv) molecule. This 
includes a peptide linker covalently joining and Vj regions, which contributes to the 
stability of the molecule. Other synthetic constructs that can be used include CDR 
peptides. These are synthetic peptides comprising antigen-binding determinants. 
Peptide mimetics may also be used. These molecules are usually conformationally 
restricted organic rings that mimic the structure of a CDR loop and that include 
antigen-interactive side chains. 




Synthetic constructs include chimaeric molecules. Thus, for example, humanised (or 
primatised) antibodies or derivatives thereof are within the scope of the present 
invention. An example of a humanised antibody is an antibody having human 
framework regions, but rodent hypervariable regions. Ways of producing chimaeric 
antibodies are discussed for example by Morrison et ai m PNAS,' 81, 6851-6855 (1984) 
and by Takeda ei al in Nature. 314, 452-454 (1985). 

Synthetic constructs also include molecules con^rising an additional moiety that 
provides the molecule with some desirable property in addition to antigen binding. For 
example the moiety may be a label (e.g. a fluorescent or radioactive label). 
Alternatively, it may be a pharmaceutically active agent. 

Antibodies, or derivatives thereof, find use in detection/diagnosis of S.pneumoniae. 
Thus, in another aspect the present invention provides a method for the 
detection/diagnosis of S. pneumoniae which comprises the step of bringing into contact 
a sample to be tested and antibodies capable of binding to one or more proteins 
described herein, or to homologues, derivatives and/or fragments thereof. 

In addition, so-called "Affibodies" may be utilised. These are binding proteins 
selected from combinatorial libraries of an alpha-helical bacterial receptor domain 
(Nord et al Thus, Small protein domains, capable of specific binding to different 
target proteins can be selected using combinatorial approaches. 

It will also be clear that the nucleic acid sequences described herein may be used to 
detect/diagnose S. pneumoniae. Thus, in yet a further aspect, the present invention 
provides a method for the detection/diagnosis of S,pneumoniae which comprises the 



step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as described herein. Suitably, the sample is a biological sample, such as a 
tissue sample or a sample of blood or saliva obtained from a subject to be tested. 
Such samples may be pre-treated before being used in the methods of the invention. 
Trhus, for example, a sample may be treated to extract DNA. Then. DNA probes 
based on the nucleic acid sequences described herein (ie usually fragments of such 
sequences) may be used to detect nucleic acid from S, pneumoniae. 

In additional aspects, the present invention provides: 

(a) a method of vaccinating a subject against S.pneumoniae which comprises the 
step of administering to a subject a protein or polypeptide of the invention, or a 
derivative, homologue or fragment thereof, or an immunogenic composition of the 
invention; 

(b) a method of vaccinating a subject against S.pneumoniae which comprises the 
step of administering to a subject a nucleic acid molecule as defined herein; 

(c) a method for the prophylaxis or treatment of S.pneumoniae infection which 
comprises the step of administering to a subject a protein or polypeptide of the 
invention, or a derivative, homologue or fragment thereof, or an immunogenic 
composition of the invention; 

(d) a method for the prophylaxis or treatment of S.pneumoniae infection which 
comprises the step of administering to a subject a nucleic acid molecule as defined 
herein; 



(e) a kit for use in detecting/diagnosing S.pneumoniae infection comprising one 



or more proteins or polypeptides of the invention, or homologues, derivatives or 
fragments thereof, or an antigenic composition of the invention; and 

(f) a kit for use in detecting/diagnosing S. pneumoniae infection comprising one 
5 or more nucleic acid molecules as defined herein. 

Given that we have identified a group of important proteins, such proteins are 
potential targets for anti-microbial therapy. It is necessary, however, to determine 
whether each individual protein is essential for the organism's viability. Thus, the 
10 present invention also provides a method of determining whether a protein or 

polypeptide as described herein represents a potential anti-microbial target which 
comprises inactivating said protein and determining whether S.pneumoniae is still 
viable, in vitro or in vivo. 

15 A suitable method for inactivating the protein is to effect selected gene knockouts, ie 
prevent expression of the protein and determine whether this results in a lethal 
change. Suitable methods for carrying out such gene knockouts are described in Li et 
al, P,N,A,S.. 94:13251-13256 (1997). 

20 In a final aspect the present invention provides the use of an agent capable of 

antagonising, inhibiting or otherwise interfering with the function or expression of a 
protein or polypeptide of the invention in the manufacture of a medicament for use in 
the treatment or prophylaxis of S, pneumoniae infection. 

25 The invention will now be described with reference to the following examples, 
which should not be construed as in any way limiting the invention. The examples 
refer to the figures in which: 



Figure 1: shows the results of various DNA vaccine trials; and 
Figure 2: shows the results of further DNA vaccine trials. 

EXAMPLE 1 



The Genome sequencing of Streptococcus pneumoniae type 4 is in progress at the 

Institute for Genomic Research (TIGR, Rockville, MD. USA). Up to now, the 
whole sequence has not been completed or published. On 21'^ November 1997, the 
TIGR centre released some DNA sequences as contigs which are not accurate 
reflections of the finished sequence. These contigs can be downloaded from their 
Webster (www(g)titTr.org). We downloaded these contigs and created a local database 
using the application GCGToBLAST (Wisconsin Package Version 9.1, Genetics 
Computer Group (GCG), Madison. USA). This database can be searched with the 
Fast A and TfastA procedures (using the method of Pearson and Lipman {PNAS USA, 
85:2444-2448 (1988)). 

Using Fast A and TfastA procedures, the local pneumococcus database was searched 
for putative leader sequence or anchor sequence features. Relevant sequences were 
used to interrogate for comparative novel sequences. These were: 

(i) already described leader sequences of Streptococcus pneumoniae (from 
proteins NanA, NanB. LytA, PapA. pcpA. PsaA and PspA); 

(ii) the leader sequence of Usp45, a secreted protein from Laaococcus laais\ 



(iii) new hypothetical leader sequences derived from the searches in (i) and 
(ii); 



(iv) the anchor motif LPxTG, a feature common to many Gram-positive 
bacteria surface proteins which are anchored by a mechanism involving the 
Sortase complex proteins. 

Provided below is an example of this approach, with reference to the sequences 
derived from the database (see table 1). 



17 



Usp45 



NanA 



PapA 



PsaA 



PspA 



PrtP 





4121.1 4101.3 4172.4 4106.1 4346.1 4194.1 



4172.2 



4147.2 



4191.6 



4170.8 4175.3 



4137.3 4193.1 4122.1 



4187.2 4188.1 4102.1 



4136.1 4127.1 4191.1 

i 1 



4119.1 



4119.2 



The protein leader sequences of different known exported proteins were used as a 
starting point for a search of the local pneumococcus database described above. The 
hypothetical proteins found with this search were then submitted to a Blast search in 
general databases such as EMBL, Swissprot etc. Proteins remaining unknown in the 
pneumococcus are kept and annotated. Then the search is performed again using the 
new potential protein leader sequence as a probe, using the TfastA procedure. 



Example 2: DNA vaccine trials 
pcDNA3.1-h as a DNA vaccine vector 
DCDNA3.1 + 

The vector chosen for use as a DNA vaccine vector was pcDNAS, 1 (Invitrogen) 
(actually pcDNA3.1 + , the forward orientation was used in all cases but may be 
referred to as pcDNAS.l here on). This vector has been widely and successfully 
employed as a host vector to test vaccine candidate genes to give protection against 
pathogens in the literature (Zhang, et aL, Kurar and Splitter, Anderson etal). The 
vector was designed for high-level stable and non-replicative transient expression in 
mammalian cells. pcDNAS.l contains the ColEl origin of replication which allows 
convenient high-copy number replication and growth in E. coli. This in turn allows 
rapid and efficient cloning and testing of many genes. The pcDNAS.l vector has a 
large number of cloning sites and also contains the gene encoding ampicillin 
resistance to aid in cloning selection and the human cytomegalovirus (CMV) 
immedikte-early promoter/enhancer which permits efficient, high-level expression -of 
the recombinant protein. The CMV promoter is a strong viral promoter in a wide 
range of cell types including both muscle and immune (antigen presenting) cells. 
This is important for optimal immune response as it remains unknown as to which 
cells types are most important in generating a protective response in vivo. A T7 
promoter upstream of the multiple cloning site affords efficient expression of the 
modified insert of interest and which allows in vitro transcription of a cloned gene in 
the sense orientation. 

Zhang, D., Yang, X-, Berry, J. Shen, C, McClarty, G. and Brunham. R.C, (1997) 
"DNA vaccination with the major outer-membrane protein genes induces acquired 
immunity to Chlamydia trachomatis (mouse pneumonitis) infection". Infection and 
Immunity, 176, 1035-40. 

Kurar, E. and Splitter, G.A. (1997) "Nucleic acid vaccination of Brucella abortus 
ribosomal L7/LI2 gene elicits inunune response". Vaccine, 15, 1851-57. 

Anderson, R., Gao, X.-M., Papakonstantinopoulou, A., Roberts, M. and Dougan, 
G. (1996) "Immune response in mice following immunisation with DNA encoding 
fragment C of tetanus toxin". Infection and Invnunity, 64, 3168-S17S. 

Preparation of DNA vaccines 



Oligonucleotide primers were designed for each individual gene of interest derived 
using the LEEP system. Each gene was examined thoroughly, and where possible, 




primers were designed such that they targeted that portion of the gene thought to 
encode only the mature ponion of the gene protein. It was hoped that expressing 
those sequences that encode only the mature portion of a target gene protein, would 
facilitate its correct folding when expressed in mammalian cells. For example, in the 
5 majority of cases primers were designed such that putative N-terminal signal peptide 
sequences would not be included in the final amplification product to be cloned into 
the pcDNAS.l expression vector. The signal peptide directs the polypeptide 
precursor to the cell membrane via the protein expon pathway where it is normally 
cleaved off by signal peptidase I (or signal peptidase II if a lipoprotein). Hence the 
10 signal peptide does not make up any part of the mamre protein whether it be 

displayed on the surface of the bacteria surface or secreted. Where a N-terminal 
leader peptide sequence was not immediately obvious, primers were designed to 
target the whole of the gene sequence for cloning.and ultimately, expression in 
pcDNA3.1. 

15 

Having said that, however, other additional features of proteins may also affect the 
expression and presentation of a soluble protein. DNA sequences encoding such 
features in the genes encoding the proteins of interest were excluded during the 
20 design of oligonucleotides. These features included: 

1. LPXTG cell wall anchoring motifs. 

2. LXXC ipoprotein attachment sites. 

3. Hydrophobic C-terminal domain. 

25 4. Where no N-terminal signal peptide or LXXC was present the stan codon was 
excluded. 

5. Where no hydrophobic C-terminal domain or LPXTG motif was present the stop 
codon was removed. 

30 Appropriate PCR primers were designed for each gene of interest and any and all of 
the regions encoding the above features was removed from the gene when designing 
these primers. The primers were designed with the appropriate enzyme restriction 
site followed by a conserved Kozak nucleotide sequence (in all cases) GCCACC was 
used. The Kozak sequence facilitates the recognition of initiator sequences by 

35 eukaryotic ribosomes) and an ATG start codon upstream of the insert of the gene of 
interest. For example the forward primer using a BamHl site the primer would 
begin GCGGGATCCGCCACCATG followed by a small section of the 5* end of the 
gene of interest. The reverse primer was designed to be compatible with the forward 
primer and with a Notl restriction site at the 5* end in all cases (this site is 

40 TTGCGGCCGC). 

PCR primers 



The following PCR primers were designed and used to amplify the truncated genes 
of interest. 



ID210 

Forward Primer 5* CGGATCCGCCACCATGTCTTCTAATGAATCTGCCGATG 
3' 

Reverse Primer 5" TTGCGGCCGCCTGTTTAGATTGGATATCTGTAAAGACTT 

3" 

4172.5 

Forward Primer 5' 

CGCGG ATCCGCCACC ATGG ATTTTCCTTC A A ATTTGGAGG 3 ' 
Reverse Primer 5' TTGCGGCCGCACCGTACTGGCTGCTGACT 3' 

f 

ID211 

Forward Primer 5' 

CGGATCCGCC ACCATGAGTGAGATCAAAATTATTAACGC 3 ' 
Reverse Primer 5' TTGCGGCCGCCGTTCCATGGTTGACTCCT 3' 

4197.4 

Forward Primer 5' CGCGGATCCGCCACCATGTGGGACATATTGGTGGAAAC 

3' 

Reverse Primer 5* TTGCGGCCGCTTCACTTGAGCAAACTGAATCC 3' 
4122.1 

Forward Primer 5' 

CGCGGATCCGCCACCATGTCACAAGAAAAAACA AAAAATGAA 3 ' 
Reverse Primer 5* TTGCGGCCGCATCGACGTAGTCTCCGCC 3" 

4126.7 

Forward Primer 5' 

CGCGGATCCGCCACCATGCTGGTTGGAACTTTCTACTATCAAT 3' . 
Reverse Primer 5' TTGCGGCCGCAACTTTCGTCCCTTTTTGG 3' 




4188.11 

Forward Primer 5" CGCGGATCCGCCACCATGGGCAATTCTGGCGGAA 3' 
Reverse Primer 5' TTGCGGCCGCTTGTTTCATAGCTTTTTTGATTGTT 3' 

5 

ID209 

Forward Primer 5' 

CGCGGATCCGCCACCATGCTATTGATACGA AATGCAGGG 3 ' 
10 Reverse Primer 5' TTGCGGCCGCAACATAATCTAGTAAATAAGCGTAGCC 3' 

ID215 

Forward Primer 5' CGCGGATCCGCCACCATGACGGCGACGAATTTTC 3' 
1 5 Reverse Primer 5 ' TTGCGGCCGCTTA ATTCGTTTTTGA ACT AGTTGCT 3 ' 

4170.4 

Forward Primer 5' 

20 CGCGG ATCCGCC ACC ATGGCTGTTTTTCTTCGCTATCATG 3 ' 

Reverse Primer 5' TTGCGGCCGCTTTCTTCAACAAACCTTGTTCTTG 3' 

4193.1 

25 Forward Primer 5' 

CGCGG ATCCGCCACCATGGGTAACCGCTCTTCTCGTAAC 3 ' 
Reverse Primer 5' TTGCGGCCGCGCTTCCATCAAGGATTTTAGC 3* 

Cloning 

30 

The insert along with the flanking features described above was amplified using PGR 
against a template of genomic DNA isolated from type 4 5. pneumoniae strain 11886 
obtained from the National Collection of Type Cultures. The PCR product was cut 
with the appropriate restriction enzymes and cloned in to the multiple cloning site of 

35 pcDNA3.1 using conventional molecular biological techniques. Suitably mapped 
clones of the genes of interested were cultured and the plasmids isolated on a large 
scale (> 1.5 mg) using Plasmid Mega Kits (Qiagen). Successful cloning and 
maintenance of genes was confirmed by restriction mapping and sequencing — 700 
base pairs through the 5' cloning junction of each large scale preparation of each 

40 construct. 



Strain validation 
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30 



35 




A strain of type 4 w as^used in cloning and challenge methods which is thg^ 
r4r6m which the S. pneumoniae genome was sequenced. A freez 
homogeneous laboratory strain of type 4 S, pneumoniae strainV 
5 obtained from the National CollecSon of Type Strains. The ampule was 

the cultured re suspended with 0.5 ml of tryptic soy broth (0.5%^tiieose75% 
blood). The suspension was subcultured into 10 ml tryptic soy broth (0.5% glucose, 
5% blood) and incubated statically overnight at 37**C. This culture was streaked on 
to 5 % blood agar plates to check for contaminants and confirm viability and on to 
10 blood agar slopes and the rest of the culture was used to make 20% glycerol stocks. 
The slopes were sent to the Public Health Laboratory Service where the type 4 
serotype was confirmed. 

A glycerol stock of NCTC 1 1886 was streaked on a 5% blood agar plate and 
15 incubated overnight in a C02 gas jar at ST'C, Fresh streaks were made and optochin 
sensitivity was confirmed. 

Pneumococcal challenge 

20 A standard inoculum of type 4 S. pneumoniae was prepared and frozen down by 

passaging a culture of pneumococcus Ix through mice, harvesting from the blood of 
infected animals, and grown up to a predetermined viable count of around 10^ 
cfu/ml in broth before freezing down. The preparation is set out below as per the 
flow chart. 



Streak pneumococcal culture and confirm identity 

I 

V 

Grow over-night culture from 4-5 colonies on plate above 

I 

V 

Animal passage pneumococcal culture 
(i.p. injection of cardiac bleed to harvest) 



40 



Grow over-night culture from animal passaged pneumococcus 



10 



20 



25 



I 

V 

Grow day culture (to pre-determined optical density) from over-night of animal 
passage and freeze down at -70 "C - This is standard minimum 

I 

V 

Thaw one aliquot of standard inoculum to viable count 



15 . - 

Use standard inoculum to determine effective dose (called Virulence Testing) 

I 

V 

All subsequent challenges - use standard inoculum to effective dose 

An aliquot of standard inoculum was diluted 500x in PBS and used to inoculate the 
mice. 

Mice were lightly anaesthetised using halothane and then a dose of 1.4 x lO^ cfu of 
pneumococcus was applied to the nose of each mouse. The uptake was facilitated by 
the normal breathing of the mouse, which was left to recover on its back. 

30 5. pneumoniae vaccine trials 

Vaccine trials in mice were carried out by the administration of DNA to 6 week old 
CBA/ca mice (Harlan. UK). Mice to be vaccinated were divided into groups of six 
and each group was immunised with recombinant pcDNA3.1 + plasmid DNA 

35 containing a specific target-gene sequence of interest. A total of 100 /ig of DNA in 
Dulbecco's PBS (Sigma) was injected intramuscularly into the tibialis anterior 
muscle of both legs (50 fi\ in each leg). A boost was carried using the same 
procedure 4 weeks later. For comparison, control groups were included in all 
vaccine trials. These control groups were either unvaccinated animals or those 

40 administered with non-recombinant pcDNA3. 1 -h DNA (sham vaccinated) only, 

using the same time course described above. 3 weeks after the second immunisation, 
all mice groups were challenged intra-nasally with a lethal dose of S, pneumoniae 




serotype 4 (strain NCTC 11886). The number of bacteria administered was 
monitored by plating serial dilutions of the inoculum on 5% blood agar plates. A 
problem with intranasal immunisations is that in some mice the inoculum bubbles out 
of the nostrils, this has been noted in results table and taken account of in 
5 calculations, A less obvious problem is that a certain amount of the inoculimi for 
each mouse may be swallowed. It is assumed that this amount will be the same for 
each mouse and will average out over the course of innoculations. However, the 
sample sizes that have been used are small and this problem may have significant 
effects in some experiments. All mice remaining after the challenge were killed 3 or 

10 4 days after infection. During the infection process, challenged mice were monitored 
for the development of symptoms associated with the onset of S. pneumoniae 
induced-disease. Typical symptoms in an appropriate order included piloerection, 
an increasingly hunched posture, discharge from eyes, increased lethargy and 
reluctance to move. The latter symptoms usually coincided with the development of 

15 a moribund state at which stage the mice were culled to prevent further suffering. 

These mice were deemed to be very close to death, and the time of culling was used 
to determine a survival time for statistical analysis. Where mice were found dead, 
the survival time was taken as the last time point when the mouse was monitored 
alive. 

20 

Interpretation of Results 

A positive result was taken as any DNA sequence that was cloned and used in 
challenge experiments as described above which gave protection against that 

25 challenge. Protection was taken as those DNA sequences that gave statistically 

significant protection (to a 95% confidence level (p<0.05)) and also those which were 
marginal or close to significant using Mann-Whitney or which show some protective 
features for example there were one or more outlying mice or because the time to the 
first death was prolonged. It is acceptable to allow marginal or non-significant results 

30 to be considered as potential positives when it is considered that the clarity of some of 
the results may be clouded by the problems associated with the administration of 
intranasal infections. 
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Table 1 



4101.1 / 

ATGGAAGAGTTAGTGACCTTAGATTGTTTGTTTATTGACAGAACrAAGATTGAAGCCAATGCC^ 

TTGTGTGGAAGAAAACGACAGAGAAATTCrCCGCCAAACTTCAAGAACAGATACAGGTCTATTTTCA^ 

TCACTCCCCTTCTGATTAAATATGCCATGrrTGATAACAAACAAAAGAGAGGGTATAAAGACTC AGCT Ay^^ 

TAGCGAATTGGCACTATAATGACAAGGAGGATAGCTACACACATCCTGATGGCTGCTATT^ATCiC 

CAAATATCAG^\AACACAGACAGACITTCAACAAGAAATCAAGGTTTACTACGCCGA(;G/^ 

TCAAAAGGGACTGTATATGAACGAACGCTATCAAAACTTGAAAGCTAAAGAATG^ 

AGGTAGACAGATTTTCGCTCAACGCAAGATrGATCTGGAACCTGTCTTTGGGCAG^^A^ 

AAGAGATGTAATCTGAGAGGGAAGCGTCAAGTGAGAATTGACATGGGATTGGT^k^TATGGCCAATAACCTCCTA 
AAATATAGTAAAATGAAATAA 



4101.3 , , 

15 ATGGGGAAAGGCCATTGGAATCGGAAAAGAGTTTATAGCATTCGTAACm'GCTGTGGGACCTT^ 
ATTGGGACTTGTGCAGTTTTATTAGGAGGAAATATAGCTGGAGAATCIWAGTT^ 
CTCATACTGCTGAGAAACCTAAAGAGGAAAAAATGATAGTAGAAGi^^ii^ 
AATATACTTGAAACGACAGAACAAAGTGAACCTAGTTCAACrGjySqCT 

GCCGTAACTCCAAAAGAGGAAAAAGTGTCTGCTAAACCGGAAGAAAAAGCTCCAAGGATAGAATC 
20 AAATCAAGAAAAACCGCTCAAGGAAGATGCTAAAGCTGTAAG^^TGAAGAAGTGAATCAAA 
GGAAAGTGGATTTTAATCAAAATTGGTACTTTAAACTCAATOC!^^ 
ACGTATCTACGTGGAAAAAATTAGATTTACCGTATGACTG9AGTATCTTTAACGArr^ 
ACAAAj\TGAAGGTGGACAGCTCAj\CGGTGGGGAAGCTT;GGTATCGCAAGACTTTCAAACT 

CAAG AAAAATGTTCGCCTTA C ITU G ATGGCGTCTAC ATOG ATTCTCAAGTTTATGTCAATGGTCAGTTAGTGGGG 
25 CATTATCCAAATGGTTATAj\CCAGTTCTCATATGATA5<5aCCAAATACCTTCAAA^ 

ttgctgtccatgcagtcaacaaacagcc^agtagqdentggtat^ 

acaagtgacagataaggtgcatgttgagaaaaatoggacaactattttaacaccaaaacttgaagaao^ 
tggcaaggttgaaactcatgtgaccagcaaaatcdtcaatacggacgacaaagaccatgaact^ 

TCAAATCGTTGAACGAGGTGGTCATGCTGTAAC^^ 
30 ATCAACAAGCCTAGATGCGATTTTAGAACTX^Ou^GACCAAAACTCTGGACTU I I 1 iAAATGACAAACCTGCCTTG 

TACGAATTGATTACGCGTCTTTACCGTGACG/rrCAATTGCTTGATGCTAAGAAGGAl 1 lUl 1 1 GCTTACCCTTACT 
ATCACTGGACTCCAAATGAAGU ni CTCnTTGAATGGTGAACGTATTAAATTCCATG^ 
CCATGGGGCGCTTGGAGCAGAAGAAAA<n^\TAAAGCAGAATATCGCCGTCTCAAACAAATGA^ 

ttaactccatccgtacaacccacaaccctgctagtgagcaaaccrrgcaaatcgcagcagaactagg^ 
35 tcaggaagaggcctttgatacgtggtatcgtggcaagaaaccttatgactatggacgrric'l i igaaaaagatgc 
cactcacccagaagctcgaaaaggx9vwvaaatggtctgattttgacct 
caj\ccctgctatcitcatgtggtca^atggtaatgaaatagctg^ 

gttaaacgtttggttaaggttatc^ggatgttcataagactcgctatgttaccatgggagcagataaat^ 
tcggtaatggtagcggagggcatgagaaaattgcrgatgaactcgatgctgttggatttaactattctgaagata 

40 attacaaagcccttagagctaagcatccaaaatggttgattt 

tggaagttactatcgccctga^)cgtgaattgaaacatagcaatggacctgagcgt^ 
tggaaatgatcgtgtgggttg^gggaaaacagcaaccgcttcatgga ci ' ri i gaccgtgacaacgctggct atgc 
tggacagtttatctggacagotacggactatattggtgaacctacaccatggcacaaccaaaatcaaactccrgtt 
aagagctcttactttggtatggtagatacagccggcattccaaaacatgacttctatct 

45 tttctgttaagaj^gaaacggfatggtacaccttcttcctc 

tagctgactcagaagg tayga ttccagttcgtgcttattcgaatgcntcr 
atctcttggtcttaagacmtcaataaaaaacaaaccagcgatgggcggact^ 
tgaactttatcttgaat<maaagttgcctatcaaccaggtaccrtggaagc^ 
gaaattgctcgagataagattacgac^ 

50 gcagcagatggaaaagacttgacrtacatctactatgaaattgttgacagccaggggaatgtggt^ 

aataatctcgttcgqfrccaattgcatggccaaggtcaactggtcggtgtagataaccgagaacaagccagccgt 
gaacgctataaggccjcaagcag atgg y\ ci 1 gg attcgtaaagcatttaatggtaaaggtcttgccattg tcaa at 
caactgaacaagcvoggaaattcaccctgactccccactct 

tggtaagaaagaaggacaagagaagactgttttggggacagaagtgccaaaagtacagaccattattggaga 
55 cacctgaaatgccmccactgttccgtttgtatacagtgatggtagccctgcagaacctcct 

agtagatgtgagc^aagcctggtattgtaacggtgaaaggtatggctgacggacgagaagtagaagctcgtgtaca 
agtgattgctctt^jaatcagagctaccagttgtgaaacgtat^ 

tctgtttcctatciyrrigattgatggaagtgttgaagagtatgaagtggacaagtgggagattgccgaagaagata 
aagctaagttagaaattccaggttctcgtattcaagcgaccggttatttagaaggtcaaccaattcatg 
60 tgtggtagaagaaggcaatcctgcggcacctgcagtaccaactgtaacggtrggtcgtgaggcagtaacaggtct 
tactagtcaaa/saccaatgcaataccgcactcttgcttatggagc^ 

aaatgcagctgtfracagttcttcaagcaagcgcagcaaacgccatgcgtgcgagcatcrrtattcagcct^^ 
tggtggccctcjttcaaaccratgcaattcaatrccttgaagaagcgccaaaaatt 

gaaaaagctgKc:agtctcaaagaagaccaaactgtcaaattgtcggttcgagctcactatcaagatggaacgcaa 



/ 



30 



GCTGTATTACCAGCTGATAAAGTAACCTTCTCTACAAGTGGTGAAGGGGAACTCGCAATTCGTAAAGG^ 
GAGTTGCATAAGCCAGGAGCAGTCACTCTGAACGCTGAATATGAGGGAGCTAAAGACCAAGTTGA.'\CrCACT 
CAAGCCAATACTGAGAAGAAGATTGCGCAATCCATCCGTCCTGTAj\ATGTAGTGACAGATTTCCATCAGGAACCA 
AGTCTTCCAGCAACAGTAACAGTTGAGTATGACAAAGGTTTCCCTAAAACTCATAj^AG^ 
5 CGAAAGAAAAACTAGACTCCTATCAAACATTTGAAGTACTAGGTAj\AGTTGAAGGAATTGACCTTGA^ 

CAAAAGTCTCTGTAGAAGGTATCGTTTCAGTTGAAGAAGTCAGTGTGAC^ACTCCAATCGCAGAAGCACCAC^ 
TACCAGAAAGTGTTCGGACATATG ATTCA AATGGTCACGTTTCATCAGCTAAGGTTGCATGGCATGCGATTCGTCC 

agagcaatacgctaaggaaggtgtctttacagttaatggtcgcttagaaggtacgcaattaacaacta;^ 
tgttcgcgtatctgctcaaactgagcaaggtgcaaacatttctgaccaatggaccggttcagaattgccact^ 
10 tttgcttcagactcaaatccaagcgacccagtttcaaatgttaatgacaagctcatttcct 

ccaatcgttggacaaactggaatcgtactaatccagaagcttcagtcggtgttctgtttggagattcagg^ 
gagcaaacgctccgttgataatctaagtctcggattccatgaagaccatcgagttggtgtaccgaactcttatgtg 
attg agtattatcttggtaag actgtcccaacagctcctaaaaacccta g 1 r n gttggtaatg agg accatgtct 

TTAATGATTCTGCCAACTGGAAACCAGTTACTAATCTAAj\AGCCCCrGCTCAACTCAAGGCTGGAGAAATGAACC 
15 ACTTTAGCTTTGATAAAGTTGAj\ACCTATGCTGTTCGTATTCGCATGGTTAAAGCAGATAAC^ 

TATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAGCAAGGACAAACAAGAATCCAACTTCACGG 
CAAAGACrrAGCAAACTTCAj\CCCTGATTTGACAGACTACTACCTTGAGTCrGTAGATGGAAAA^^ 
ACAGCAAGTGTTAGCAACAATGGTCTCCCTACCGTCGTTCCAAGCGTTCGTGAAGGTGAGCCAGTTCGTGTCATCG 
CGAAAGCTCAAAATGGCGACATCTTACGAGAATACCGTCTGCACTTCACTAAGGATAAGAGCnTACTTTC^ 
20 AACCAGTTGCTGCGGTTAAACAAGCTCGCITGCTACAj\GTAGGTCAAGCACTTGAATTGCCGACTAAGGT^ 

TTACTTCACAGGTAAAGACGGCTACGAAACAAAAGACCTGACAGTTGAATGCGAAGAAGTTCCAGCGGAAAATCT 
GACAAAAGCAGGTCAATTTACTGTTCGAGGCCGTGTCCTTGGTAGTAACCTTGTTGCTGAGATCACTGTAC^ 
ACAGACAAACTTGGTGACACICU riCAGATAACCCTAACTATGATGAAAACAGTAACCAGCCCTTTGC^ 
■ ACCAATGATATTGACAAAAACTCTCATGACCGCCTTGACTATCTCAATGACGGAGATCATTCAGAAAATCGTCGTT 
25 GGACAAj\CTGGTCACCAACACCATCTTCTAATCCAGAAGTATCAGCCGGTGTGATTTTCCCTGAAj^ 

ttgtagaacggactgttacacaaggaaaagttcagttctttgcagatagtggtacggatgcaccatctaaactcgt 

tttagaacgctatgtcgctccagactttgaagtgccaacctactattcaaactaccaagcctacgacgcagacc^ 

ccattcaacaatccagaaaattgggaagctgttccttatcctgccgataaagacattgcagctggtgatgaa^ 

AACCTAACATTTAAAGCTATCAAAGCCAAAGCTATGAGATGGCGTATGGAGCGTAAAGCAGATAAGAGCGGTGTT 
30 GCGATGATTGAGATGACCTTCCTTGCACCAAGTGAATTGCCTCAAGAAAGCACTCAATCAAAGATTCTTGTAGATO 
GAAAAGAACTTGCTGATTTCCCTGAAAATCGTCAAGACTATCAAATTACCTATAAAGGTCAACGGCCAA^ 
CAGTTGAAGAAAACAj\TCAAGTAGCrrCAACTGTGGTAGATAGTGGAGAAGATAGCTTTCCAGTACTT 
CGTTTCAGAAAGTGGAAAACAAGTCAAGGAATACCGTATCCACTTGACTAAGGAA 

AAACCAG r 1 1 CTGAG AAGACAGTTGCTGCTGTACAAGAAGATCTTCCAAj^AATCGAATTTGTTGAAAAA^ 
35 GCATACAAGACAGTTGi^AAAAAAGATTCAACACTGTATCTAGGTGAAACTCGTGTAGAA 

GGAAAAGAACGTATCTTTACAGCGATTAATCCTGATGGAAGTAAGGAAGAAAAACTCCGTGAAGTGGTAGAAG^ 

ccgacagaccgcatcgtcttggttggaaccaaaccagtagctcaagaagctaaaaaaccacaagtgtcagaaaaa 
gcagatacaaaaccaattgattcaagtgaagctagtcaaactaataaagcccagttaccaagtacaggtagtgcg 
gcaagccaagcagcagtagcagcaggtttaactcttctagctttgagtgcaggattagtagttactaaaggtaaa 
40 aaagaagactag 



4101-5 

ATG GATG CAATCTTTGAC CTAATCGGAA AGGriU'rCAATCCCATCTTAGAAATGGGTGGACCTGTCATCATGTT^ 
45 TCATTTTGACAGTATTGGCTTTACl l 1 I IGGAGTGAAATTCTCCAAAGCGCTTGAAGGTCGTATCAAACTTGCCAT 

CGCTCTTACAGGTATCGGTGCrATCATCGGTATGCTAAACACTGCriTCTCAGCATCACTAGCAAAATTCGTTC 
AACACTGGTATCCAATTGAGTATTACCGACGTTGGTTGGGCACCACTTGCTACAATCACTTGGGGTTCTGC^ 

cactat actt cttgctcatcatgttgattgtcaacatagtgatgctagctatgaagaaaacagatacacttgatgt 
cgatat ctttg ata tctg gcacttgtctatcacaggtctcttgattaaatggtatgctgataacaatggt 

50 caaggggt7tcactctttattgctacagcagctatcgtccttgtcggtgtgttgaaaattatcaact 

gaaacctacatttgatgaccttcrtaacgccccaagttcatcaccaatgacatcaactcacatgaact^ 
aacccagttatcatggttttggataagal i rmgaaaaattcttcccaggccttgataaatatgactttgatgctg 
ctaaattgaacaagaaaatcggtttctggggatctaaattcttcatcggtttcatccttcgtatcgt^ 
atgggaactccacatccaat tgcag gtgttgcagatgcagataaatggcgt c rr g l-iatcaaaggatggttgtctc 

55 ttggtttgactgccggtgtatctttggaactcttctcacttatcggttcatggttcatcgca 

tcacaaggtattacaaacgttgctactaaacctcttcaaggacgtaaattcaatatcggtcttgactggcc^rrc^ 
tcgctggtcgtgctgaaatctgggcttgtgccaacgtacttgcaccaatcatgttgattgaagcagtg cllcrri c 
aaaacttggaaatggtatcttgccacttgcaggtatcatcgctatgggtgttactccagctctcttggttgtaact 
cgtgctaaattgctccgtatg attatcttcgg aacactcttgttgccactcttc cr i ' cl ■ i ' i ' caggtacacttattgc 

60 accatttgcaacagaacttgctaaaggtgtaggtgccttcccagaaggtgtgagccaaactcaattgattact 
tctactcttgaaggaccaatcgaaaaacttcttggttggacaattggtaacactacaacrggtgatatca^ 
tccttcgtgcagtagtcttccttgtattctatatcggtatctttgcttgctacagaaaacaaatgatc^^ 
cgaagagtacgcagcaaaagcaaaataa 

65 4102.1 



31 



ATGAAGATTATGAAAAAAAAATATTGGACTTTAGCGATATTATT CI lll G l M OTl CAATAATTCTGTTACTGCTCA 
AGAAATACCTAAAAATCTTCATGGCAATATAACrCACACTCAGACrAGCGAAAGTTTTT 
ACAGGTTGACTATTCTAATAAAAATCAAGAAGAAGTAGACCAAAATAAATrrCCTATTCA^ 
ATTATTTGTAACAACAGATAAACATTTAGAAAAAAACTGTTGTAAATTGGAACTTG 
5 TATTGTTAACrCTGAAAGTAATAATTTACrAGGCGAAGATAATTTAGATAATAAAATTAAGGAAA^ 

CTAGATAATAGAGGAGGAAATATAGAGCATGACAAAGATAACTTAGAATCGTCGATTGTAAGAAAATATGAATGG 
GATATAGATAAAGTTACTGCTGGAGGCGAAAGTTATAAATTATATTCTAAj\AGTAAT^ 
TTTTAGATTCAGGAGTCGArrrACAAAATACTGGATTACTGAAAAATCTTTCAAATCACTCAAA 
CAATAAAGGATATTTAGGAAAAGAGGAGGGAGAGGAAGGAATAATATCAGATATTCAAGATAGATTAGGTCATG 

10 GTACGGCTGTTGTAGCTCAAATTGTAGGGGATGACAATATTAATGGAGTAAATCC^ 

TAGAATATTTGGTAAGTCGTCAGCTACTCCAGATTGGATTGTAAAAGCAATTTTTGATGCTGTA 
G ATATTA TCAATCTTAGTACTGGACAATATTTAATGATTGATGGAGAATATGAGGACGGAACAAATGATTT^ 
CATTTTTGAAGTATAAAAAGGCTATTGATTACGCGAATCAAAAAGGAGTAATTATAGTAGCTGCATTAGGGA^ 
ACTCCCTAAATGTATCAAATCAGTCAGATTTATrGAAACTTATTAGTTCACGCAAAAAAGTi^ 

1 5 AGTAGTTGATGTT CCAA GTTATTTCTCATCTACAATTTCGGTCGGAGGCATAGATCGCTTAG 

GTAATTTATCAGATTTTACCAATAAAGGGGATTCTGATGCAATATATGCGCCTGCAGGCTCA^ 
AGAATTAGG ACTTA ATAACTTTATTAATGCAGAAAAATATAAAGAAGATTGGA l i ri Tl CGGCAACACTAGGAGG 
ATATACGTATCTTTATGGAAACrCATTTGCTGCrCCTAAACTTTCTGGTGCGATTGC^ 
AATTAAAAGATCAGCCCTATAATTATATGTTTGTAAAAAAATTCTGGAAGAAACATTAC^ 

4106.1 

ATGAAGAAAACATGGAAACTGTTTTTAACGCTTGTAACAGCTCTTGTAGCTCTTGTGC^ 

GAACTGCTTCTAAACACAACAAAGAGGCAGAACTTAAGAAGCTTGACTTTATCCT 

ACCACACAGGGC TTTA TGTTGCCAAGGAAAAAGGTTATTTCAAAGAAGCTGGAGTGGATGTTGATT^ 

25 CACCAGAAGAAAG 1 iCl lCTGA(nTGGTTATCAACGGAAAGGCACCATTTGOVGTGTATTTCCAAGACTACATGGC 

TAAGAAATTGGAAAAAGGAGCAGGAATCACTGCCGTTGCAGCTATTGTTGAACACAATACATCAGGAATCATCTC 
TCGTAAATCTGATAATGTAAGCAGTCCAAAAGaCTTGGTTGGTAAGAAATATGGGACATGGAATGACCCAACTGA 
ACTTGCTATGTTGAAAACCTTGGTAGAATCTCA AGGT GGAGACTTTGAGAAGGTTGAAAAAGTACCAAAT;^ 
CTCAAACTCAATCACACCGATTGCCAATGGCGTCnTTGATACTGCTTGGATTTACTACGGTTGCGATGGTATC 

30 GCTAAATCTCAAGGTGTAGATGCTAACTTCATGTACTTGAAAGACTATGTCAAGGAGTTTGACTACTATTCACC^ 
TTATCATCGCAAACAACGACTATCTGAAAGATAACAAAGAAGAAGCTCGCAAAGTCATCCAAGCCATCAAAAAAG 
GCTACCA ATAT GCCATGGAACATCCAGAAGAAGCTGCAGATATTCTCATCAAGAATGCACCTGAACTCAAGGAAA 
AACGTGACTTTGTCATCCAATCTCAAAAATACTTGTCAAAAGAATACGCAAGCGACA^ 
TTGACGCAGCTCGCTGGAATGCTTTCTACAAATGGGATAAAGAAAATGGTATCCTTAAAGAAGACT^ 

35 AAGGCTTCACCAACGAATTTGTGAAATAA 

4106.4 

ATGATAAAAAA TCCT AAATTA TTAA CCAAGTCI 11 11 i' AAGAAGTTTTGCAATTCTAGGTGGTGTTGGTCTAGTCA 
- TTCATATAGCTATTTATTTGACCTTTCCTrTTTATTATATTCAACTGG AGGCGG A^ 

40 agtgtttacggagtatttaaagactaagacatctgatgaaattccaagcrractccagtcttattca^ 

accatatctgctcaccttaaaagagatattgtagataagcggctccctcttgtgcatgacttggatattaaagatg 

GAAAGCTATCAjaLATTATATCGTGATGTTAGATATGTCTCTTAGTACAGCAGATGGTAAACAGGTAACCGTGCAATT 
TGTT CACGGGGTGGATGTCTA CAAAGAAGCAAAGAATATTTTGCTTTTGTATCTCCCATATACATTT^ 
ATTGCl rrriCCTTTGl i 1 i llCl'lATTTTTATA CTAA ACGCTTGCTCAATCCT ClU ' iUUT ACATTTCAGAAGTGACT 
45 AGTAAAATGCAAGATTTCGATGACAATATTCGTTTTGATGAAAGTAGGAAAGATGAAGTTGGTGAACTTGGA 

CAGATTAATGGTATGTATG AGCA CTTGTTGAAGGTTATTTATGAGTTGGAAAGTCGTAATGAGCAAATTGTAAA^ 

tgcaaaatcaaaaggtitcctttgtccgcggagcatcacatgagttgaaaacccctttagccagtcrrag 
cctagagaatatgcagcataatattggagattacaaagatcatccaaaatatattgcaaagagtataaataagat 
tgaccagatgagccac rratt agaagaa gtac tggagtcttctaaattccaagagtggacagagtgtcgtgagac 
50 cttgactgttaagccagttttagtagatattttatcacgttatcaagaattagctcattcaat^ 

aaaatcaattgacagatgctaccagggtcgtcatgagtcttagggcattggataaggttttgacaaacctgatta 

GTAATGCAATTAAATATTCAGATAAAAATGGGCGTGTAATCATATCCGAGCAAGATGGCTATCTCTCTATCAAAA 
ATACATGTGCGCCTCTA AGTCA CCA AGAA CTAGAACATTTATTTGATATATTCTATCATTCTCAAATCGTGACAGA 
TAAGGATGAAAGTTCCGGTTTGGGTCTTTACATTGTGAATAATATTTTAGAAAGCTATCAj^ 
5D CTCCCTTATGAACACGGTATGGAATTTAAGATTAGCTTGTAG 

4106.6 

.ATGTATTTAGGAGATTTGATGGAGAAAGCCGAGTGTGCTCAATTTTCAATAC^ 
AGACGACCG TCAA GGCTGTAATGGAAGAAACaGGATTTTCAAAAGCAACCCTAACCAAATATGTCACCCTGCTCA 
00 ATGACAAGGCTTTGGATaGTGGCTTAGACCTGGCTATTCACTCAGAAGATGAAAATCTGCCTCTGTCTATCGGTGC 
AGCTACCAAGGG GAGAG ATATTCGGAGCTTGTITTTGGAGAGTGCTGTTAAATACCAGATTTTGGT^ 
TAC CACC AACAGTT rTTAG CCCATCAGCTGGCTCAAGAATTGGTGATTAGCGAGGCTACGCTTGGTCGTCAC^ 
CTGGTTTAAATCAGATTTTGTCAGAATTT 

'TrCACTATTTCTATTTCTGTCTTrrCCGAAAGCTCTGGTCGAGTCACGAATGGGAAGGTCACATGCAGAA 
AGAAAACAGGAGATTGCCAATTTAGAGGAAATCTGCGGTGCAAGTTTGTCTGCGGGGCAGAAArrGGACTTGGTT 
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CTCTG GGCT CACATCAG TCAAC AACCTCTTCGGCTCAATGCTTGTCACTTTCAAGTCATAGA^ 
GGGTATTTrGACAATATCrrTTATCTTCGTrrGCTGAGAAAGGTTCCGTC Ll Mil 1 GCTGGGCAACATATTCCACT 
AGGAGTTGAGG ATGGTG AGATGATGATATTCTTCTCl 1 1 1 CTCCTATCTCATCGCATTCTTCCTCTTCATACTATGG 
AGTATATTCTTGGTTTTCGAGGCCAGTrGGCAGATTTACTGACGCAATTGATTCAAGAAATGAA 
5 TATTGGGGGATTATACAGAGGACCATGTCACCTATGAACTCAGTCACCTTTGTGCTCAAGTCT 
CTATATriTACAGGATCGCrACAAGTACCAGTTAGAGAATCGTC>.TCCATAT^ 
GAGACAGCAGAGGAGATTTTTCATGCTCTACCTGCTTTTCAACAGGGGACAGATrrAGATAAGAA 
AATG GCTC CAGTTA ATCGAATATATGG CTGAAA ACGGTGGCCAGCATATGCGGATTGGTCTGGATTTGACATCTGC 
TTTTCTTGTCTTTTCAAGGATGGCAGCCATTTTGAAA 
10 ATGACCCTAGTCGGCATTATGATTTGCTGGTTACCAATAACCCGATTCATAAGAAGGAACAGACACCACTCTATTA 
TTTAAAAAATGACTTGGATATGGAGGATTTGGTAGCGATTCGCCj\GTTATTATTCACTTAA 

4106.7 

ATGCAArrTTCAAAGAAAACACGTCAATTGTCAATTAAAAAAATGCAGGAACGTACCCTGGACCTC^^ 
15 GGTGG AGGA ATCACAGGAGCTGCTGTAGCCrTGCAGGCGGCAGCrAGCCGTCTTGAGACTGGTTTGATTGAAATC 
CAAGACTTTGCAGAAGGAACATCTAGTCGTTCAj^CAAAATTGGTTCACGGAGGACTTCGTTACCT 
ACGTAGAAGTGGTCTCAGATACGGTTTCTGAACGTGCAGTGGTTCAACAAATCCCTCCACACATTCCA^ 
ATCCAATGCTCTTACCAGTTTACGATGAAGATGGAGCAACCTTTAGCCTCTTCCGTCTrAAAGT^ 
GTACGACCTCTTGGCAGGTGTTAGCAACACACCAGCTGCGAAC^\AGGTTTrGAGCAAGGATC^ 
20 CCAGCCAAACTTGAAGAAGGAAGGCTTGGTAGGAGGTGGAGTGTATCTTGACTTCCGTAACAACGATGCGCCTCT 
CGTG A7TGAAAACATCAAACGTGCCAACCAAGACGGTGCCCTCATTGCCAACCACGTGAAGGCAGAAGGCTTCCT 
CTTTGACGAAAGTCGCAAGATTACAGGTGTTGTAGCTCGTGATCTCTTGACAGACCAAGTGTT^ 
CGTCTGGTTATTAATACAACaGGTCCTTGGAGTGATAAAGTACGTAATTTGTCTAj\TAAGGGAACGCAATTCT 
AAATGC GCCC AACTAAGGGAGTTCACTT GGTA GTAGATTCAAGCAAAATCAAGGTTTCACAGCCAGT^ 
25 ACACAGCTTTGGGTGACGGTCGTATGGTCTTTGTTCTCCCACGTGAAAj\CAAGACTTAC^^ 

agactacacaggtgatttggagcatccaaaagtaacrcaagaagatgtagattatctacttggcattgtc^ 

ccgcttcccagaatccaacatcaccattgatgatatcgaaagcagctgggcaggtcttcg 

tccattgattgcagggaacagtccctctgactataatggtggaaataacggtaccatcagtgatgaaagctt^ 

CAACTTG ATTGCG ACTCTTG AATCTTATCTCTCCAAAG AAAAAACACCTG AAG ATGTTG AGTCT 

30 cttgaaagtagcacatctgagaaacatttgcatccatctgcagtttctcgtgggtctagcn^ 

atggtctcttgactcttgct ggtg gtaaaat caca gactaccgtaagatggctgaaggagctatggagcgcgtgg 
ttgacatcctcaaagcagaatttgaccgtagctttaaattgatcaattctaaaac^ 
attgaacccagcaaatgtggattcagaaatcgaagcctttgcgcaacttggagtat^ 
ggaagctcactatctggcaaatctt tacg cttcaaatgcaccgaaagtctttgcacttgct 

35 gcgccaggacrcagcttggcagatactttgtcccttcactatgcaatgcgcaatc 

ACTTCci iui iCGTCGTACCAATCACATGCTCTTTATGCGTGATaGCTTGGATAGTATCGTTGAGCCAATTTT 

gaaatgggacgattctatgactggacagaagaagaaaaagcaacttaccgtgctgatgtcgaagcagctcrcgct 
aacaacgatttagcagaattaaaaaattaa 

40 4106,8 

atgatgaatgaattatttggagaatttctagggactttaatcctgattcttctaggaa^ 
tggttc ttcct aaaaccaagagcaatagctcaggttggattgtgattactatgggttgggggattgcagtt 
tgcagtctttgtatctggcaagctcagtccagcttatttaaacccagctgtgaccatc 
ggtttgccttgggctrccgttttgccttatatcttagcccagttcgcaggggccatgctgggtcagatt^ 
45 gttgcaattcaaacctcactatgaggcagaagaaaatgcaggcaatatcctggcaaccttcagtactggacc^ 

ca tcaa gga tactg tatcaaacttgattagcgaaatccttggaa li 1 1 10 it i 1 ggtcttg acaatcntgcntgg 

GTCTTTACGACTITCAGGCAGGTATCGGAACCTTTGCAGTGGCAACTTTGATTGTCGGTATCGGTCT^ 
TGGG AC AACAGGTTATGCCTTG AACCCAG CTCGTG ACCTTGG ACCTCGTATCATG CACAGCATCTTGCCA^ 
AACAAGCGAGACGGAGACTGGTCTTACGCTTGGATTCCTGTTGTAGGCCCTGTTATCGGAGCAGCCnTGGCAGTG 
jyj CTTGTATTCTCACTTTTCTAG 

4106.10 

ATGAAAAGGA£CTGGAGGAACrCATTCGTGACAAATCTTAATACACCTTT^ 

ATCGTACCGTTTTAGCGCCTATGGCTGGCGTGACCAACTCAGCCrrTCGTACTATCGCAAAGGAGCTCG 
->J ACTCGTTGTAATGGAAATGGTCTCrGACAAGGGAATCCAATACAACAACGAAAAAACCCTGCACATGCTTCATAT 
CGATGAGGGCGAAAACCCTGTCTCTATCCAACrTTTTGGTAGCGATGAAGACAGCCTAGCACGCGC^ 
CATCCAAGAAAACACCAAGACCGATATCGTCGATATCAACATGGGCTGCCCTGTCAACAAAATCGTGAAGAACGA 
AGCTGGTGCTATGTGGCrCAAGGATCCAGACAAGATTTACTCCATCATCAACAAGGTCCAGTCTGTCCTTCATATC 
CCACTTACTGTCAAAATGCGTACCGGCTGGGCGGACCCATCTCTTGCAGTAGAAAATGCTCTCCCrGCTGAAGCT 
CAGGTGTTTCTGCCCTCGCCATGCATGGCCGTACCCGTGAACAAATGTATACTGGCCACGCAGACCITGAGACCCT 
rrACAAGGTTGCCCAAGCrcrrAACCAAGATTCCATTCATCGCCAACGGTGATATCCGTACTGTCC\^ 
GCAACGCATCGAAG AAGTT GGTGCTGACGCAGTCATGATTGGCCGAGCTCCCATGGGAAATCCTTACCTCTTCAA 
CCAAATCAACCATTACTTTGAAACAGGAGAAATCCTACCTGATTTGACaTTGAAGACAA 

CGAACACrrGAAACGATTGATTAACCTCAAAGGAGAAAACGTCGCAGTTCGTGAATTCCGCGGTCTCGCTCCTCA 
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CTATCTCCCrrGGAACATCTGGCGCTCCCAAACTCCGTGGAGCCATTTCCCAAGCCACCACCCTGGCACAGATTGA 
AACCCTCTTGCAATTGGAGAAGGCTTAA 



4107.1 

ATGACAAAGAAGAAAATTGAGCGTATTTCTCTAATACACCGAGAAAj^GATTTTATGGCTCAAGTGG^ 
GAGATAAAGAACAJVCCTAAGTATAGTGTCCTTGAGCGTAAAATGTTTGATGCTGCTAAAAATC^ 
10 CTTATCAAAAATACGCAACTATCAAGCAGATAACAGATATTAGGGTACAAACAAGTGAGGCTGACATTTTAGA 
CTGTAAAAGAGGTTTATGTGTACAATCACATGAATCTTATCGGAGCTTGTCAGCGGATATTATTTATCAGTC^ 

accagcttatgataagttaj\ataagtggtttaatatctattctgatttgtattrragcg^ 
tgggggtatatcatgagatggtaggtatctag 

15 4107.2 

ATGAAAAATTCCAACGAGGCTGAGATGAAATTACTTTATACTGATATTCGGA LI ICl n GACAGAAATTCTAACAA 
GAGAGGCAGAAGAGCTAGTTGCAGCTGGCAAGCGGGTCrTCTACATTGCCCCCAACTCT CrriCrri I GAAAAGGA 
ACGCGCCCTGCTGGAATACTTGTCCCAGCAGGCl ICl 1 1 1 1 CGATTACCCTCACGCGCTTTGCTCAAATGCCTCGC 
TATCTGGTCTTGAATGATTTACCAGCTAAAACrACTCTTGATGATAJCGGTCTTGGGTTGGCCT^ 
20 TGC CGAA CTCGATCCCAAGGACTTGCGTGTTTATGGCGCTATTAAGCAGGATCCTCAATTGATCCAGCAGTTAATT 
GAGCTTTACCATGAG ATGACC AAATCTCAGATGAGTTrTTTGGACTTGGAGAATTTAACAGAT^ 
GCGGATTTACrCTTGATTTTrGAGAAAGTAACAGCCTATCrTAATCAAGGTCAGTTAG 
CCCATTTGATTGAGGCTATTGAGAATGACAAGGTAAGTAGTGATT^ 

TACTCGll 1 riCTGCTGAGGAAGAGCGGCTTGTGGACrrACTTCACGCCAAAGGTGTTGAGATrCTTATCCGGGCT 
25 TATGCTACTTAAGAAAGCCTATACCAGTCCTTTTAGCGAGGGCAATCTCTACCAAGCCAGCGTAAAAT^ 
ATCTGGCtTCTAAATACCAAACGCCTGCTCAGGACTGTTCTCAAACTCATGAGAAGATGGATAGTm 
CTCTCGTTTGTTGGAGTCTTCTTATGACTTTTCAGAACTCGCTTrGGATGTCGATGAG 

CAAATCTGGTCTTGTTTGACGCA AAAG GAGCAGTTCGAGCTAGTAGCCCGTAGTATTCGTCAGAAATTACATGAG 
•2 n AACTC AGACCTGAGCTACAA GCATTTT CGTATTCTCTTGGGGGATGTAGCTTCTTACCAGTTAT^ 
3U 1 I 1 M GACCAGTA TCAG ATTCCTTTTTATCTrGGTAGAAGCGAAGCCATGGCTCATCATCCCTTGACTCAGTT^ 

GAGTCTATTTTAGCTTTAAAACGTrACCGTTTTCGTCAGGAGGATTrGATTAATCT^ 

TGACCTCAGTCAGTCTGATATTGATGCTTTTGAGCAATATATCCGCTATCrrGGTATCAATGGCTTGCCAGCC^ 
AGCAAACCTTCACCAAiVrCCCACCATGGAAAATTTAATC^ 

ACCTCTTGAAACCCTCTTTGCCAGCCGAAAACAAAAGGCTGAAAAACTCCTACAAAAATGGAC^ 
3D AGAAGGAGCTGT GACC AAGCAG TTACAA GATrrCACAACCACTTTGGAAGCTGTAGAACAGGAAAGACAAGCCG 
AAGrTTGGAAGGCTTTCTGCCATGTTTTAGAACAATTTGCGACTGTTTTTGCTG 

GTTCCTAGCCTTGCTCCATTCTGGAATGAGTTTGTCCCAATACCGTACCATTCCAGCAACAGTGGACACT^ 
GTGCAGAGTTACGATTTGATTGCACCATTGACTGCTGACTTTGTCTATGCTATTGGACTAACrCAGGAC>^ 
CAAAAATTTCTCAAAACACCAGTCTTCTGACAGATGAAGAAAGGCAAAACCTAAACCAA^ 
4U GTTCAATTACTGATTGCCAGCAGTGAAAATCTCAA 

GTAAG CAGTT GTTCTTCTCGGCTCCAAGCCTTTTTAACGAAAGTGAAAGTAAGGAATCTGCCT 
GATCCAT TTTGG ATTTAGGCGGAGAGAGAAGAGGATGAATCACAAAGGACTGTCTAAGGAGGATATGGGGTCCTA 
TCACAGTi,! i I iGTCTAGTCTGGTTGCCTATCACCAGCAGGGTCAGATGACCGATACTGAGCAAGATTTGACTTTT 
GTCAAGGTTCTGTCGCGTGTCaTAGGTAAAAAACTAGATCAGCAj\CGTCTGGAAAJ\TCCAGCTATCCCAACCAGT 

40 CCAAGCaGCAAGaCCTTAGCCAAGGACACCTTGCAAGCTCT 

CTATCCTGCC AAAC AGGAGTrrTACCTGTCTACGTCGGGTTTGACAGAGTTTTATCGCAATGAATACAG^ 
CTAC GCTA CGTTTTAGGCTTGCAGGAGGAATTACGTTTGCATCCTGATGCCCGTACTCACGGGAATTTC^ 
GTATCTTTGAACGCGCCTTACAGTTGCCTAj\TGAAGATTCCTTTGACCAACGTCTAGAACA^ 
CAGTCAAGAACGCGAATTTGAAGCTATTTATCAAGAAAGTTTGGAAGCCCAGTTTACCAAGGAAGTTTTG^ 

DU GrrGCACGGACAA CTGGA CATATTCTCCGACACAATCCAGCCATCGAAACCATCAAAGAAGAAGCAAATTTTGGT 
GGAAAAGACCAj\GCCTTTATrCAATTAGACAATGCACGCAGTGTCTTTGTACGAGGCAAGGTCGACCGGATTGAC 
CGTTTGAAAGCTAATGGAGCGATAGGAGTAGTAGACTACAAATCCAGTCTGACTCAGTTCCAGrrrCCTCATTTCT 
TTAATGGGCTCAATTCTCAGTTACCAACCTATCrrGCTGCCCTAAAAAGAGAAGGGGAGCAGAACT^^ 
CATGTACTTGGAAATGGCTGAACCTCTCCAATCTCTGATGGCGGTAAAAAGTCTGGCAGGAGCAGTGGTAGAAGC 

DJ CAGCAAATCTATGAAATACCAAGGGCTCTTCTTGGAAAAAGAAAGCAGTTATTTAGGCGAATTTTAT^ 
CAAGGCTAA^AACTGACAGATGAGGAATTTCAGCTCCTACTGGACTACAATGCCTATC^^ 
GAGAAGATTTTAGCAGGCCG GTTC GCCATCAATCCTTATACTGAAAATGGCAGAAGCATTGCCCCATACGTCCAG 
CAACATCAGGCTATTACACGCTTTGA^GCCAATTACCATCrGGGCCAAGCCCGTTTCCTAGAAAAGTrGG 
C^GATGGCAAGCGTCTGGTCGGAGAAAJ^ACTCAAGCAAGCTTGGCTTGAAAAAATAA 

60 GAGAGGAGTTGAATCGATGA 

4107.3 

ATGAAGCTTATTCCCTTTTTAAGTGAGGAGGAGATTCAAAAACTGCAAGAAGCAGAAGCAAATTCGAGCAAGGAA 
CAGAAGAAAACTCCCGAGCAAATCGAAGCTATCTACACTTCTGCCCAGAATATCCrGGTCTCAGCATra 
00 ■^CTGGAAAGACCrrrGTCATGGCAGAGCGCArrCTGGACCAArrGGCGCGTGGTGTCGAAArrrcn'CAACTC^ 
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TCTCAACCTTTACCGTCAAGGCTGCAACTGAACTTAAAGAACGTTTAGAGAAAAAAATCAGC^ 
AAACAGATGATGTCGACCTCAAACAACACTTGGGTCGCCAGTTGGCAGACCTACCCAACGCTGCCAT TGGAA CCA 
TGGArrCTTTCACACAAAAATTCCTTGGCAAACATGCTTATCTCCTTGATATTGCACCTAAT^ 
AjVCCAAAGCGAGCAACTTATTCTCGAAAA CGAA GTCTTTCATGAGGTCTTTGAAGC^ 
5 AAAGAGACCTTTAGTCATTrGCTGAAAAACTTTCCrCGGCGTGGCAAGGACGAACGGGCT 
TATAAAATCTATGACTTCCTCCAATCCACCAGTAATCCTCAAAAGTGGCTGAGTGAA TC^^ 
AGAAAGCTGATTTTACCAGTGAAAj^GAAAAACTGACCGAGCAAATCAAACAAGCCCTT^ 
TTTTCCGTTACCATCTGGATAACGATGCCAj\GGAGTTTGCAAj\GGCTGCCTATTTAGAAA^ 
GGATGAAArrGGCTCCCTAAATCAGGAGTCCGATACTC>GGCTTATC>iGGCAGTGCTTGCGCGTGTTGTC^ 

10 TCTAAGGAGAAAAACGGTCGAGCTCTGACTAATGCCAGCCCTAAGGCTGATTTGAAGCCCCTGGCTGATG 

Aj^CGAj\GaGAGAAAGACCCAGTTTGCTAj^ACTAGGAC>ATTATCAGACCAGATAGCGAT 

TCTCGACTATCAAGAACGTTATCATGGAGACACTTCGAAACTAGCTAAAACCTTCCAATCTTT^ 

GTAGAGGCTTATCGTCAGAGAAJVACGACAGGAAAATGCCTTCGAATTCCCTGATATCAGCCATTACACCATTGAG 

ATTTTAGAGAATTTCCCACAAGTrCGTGAGTCTTATCACGAGCGCTTCCATGAAGTCATGGTCGATGAGTA 

15 ATACCAACCATATTCAAGAACGGATGCTGGAATTGTTGTCTAATGGCCACAATCGCTTTATGGTGCGAGATATCAA 
GCAATCCATCTATCCTTTCCCTCAGGCAGACCCGCAGATTTTCAATGAGAAATTCCAACGCTATGCGCAAAATC^ 
CAAGAAGGCAGGCTCATTATCCTCAAGGAAAATTTCCGTAGTAGTTCAGAAGTGCTGTCAG CAACC AA 
TTTGAj\CGTCTCATGGACCAAGAGGTCGGCGAAATCAACTATGATAACAACCACCAGC^ 
AACTGACTCCCAATCCAGACAAC::AAGGCAGCATTTCrCCTCTACOA(L\AGGACGATA 

20 GTCAAACAGAAACGAAACTAACAGGCGAAATGCGCTTAGTTATCAAGGAGATTCTGAAACTTCATC 

GTGTTGCCTTTAAGGAAATTGCCCrTCTGACCrCCAGCCGCAGTCGTAATGACCAGATTCTCCTCGCCCTCT 
GTACGGAATTCCTGTCAAAACTGACGGAGAGCAAAACAATTATCTCCAATCCCTAGAAGTGCAAGTCATG CTAGA 
CACTCTTCGTGTCATTCACAATCCCCTGCAAGACTACGCCTTGGTTGCCCTTATGAAGTCTCCAATGT^ 
ATGAGGATGAGCTAGCACGTTTGTCCCTTCAGAAAGCAGAGGATAAAGTCCACGAAAATCTCTATGAGAAACTGG 

25 TCAATGCACAAAAAATCGCAAGTAGTCAAAAAGGCTTGATTCACACAGCTCTAGCTG 

AAAAACTAAAGCAATTCATGGATATCCTAGCl Itl IGCCGCTTGTATGCCAAAACCCACTCTCTCTATGACTTGAT 
TTGGAAGATTTACAACGACCtjl I Ti l ATTATGACTATCTTGGGGCTTTGCCGAATGGTCCTGCTAGGCAGGCCAAT 
CTCTATGCCCTAGCACTGCGTGCTGATCAATTTGAAAAGAGCAATTTCAAAGGTrTGTCGCGTT^ 
TTGACCAAGTCrTAGAAGCCCAGCACGATTTGGCAAGCGTGGCCGTCGCACCGCCAAAAGATGCAGTAGAGCTCA 

30 TGACCATCCACAAGAGTAAAGGGCTGGAGTTTCCTTACGTCTTTATCCTCAATATGGATCAAGATrrC^ 

AGACTCTATGTCAGAAGTCATTCTCAGTCGTCAGAATCGTCTTGGTGTCAAATATATTGCCAAGATGGAGACAGGG 
GCAGTAGAAGACCACTATCCTAAAACCATCAAACTCTCCATTCCTAGTCTGACCTATAGGCAGAACGAAGAGGAA 
TTACAGCTAGCAAGCTATTCTGAGCAGATGCGTTrGCTGTATCTTGCTATGACGCGGGCTGAGAAAAAGCTCT 
TTGTCGGCAAGGGTTCTCGTGAAAAGCTGGAATCCAAGGAATACCCAGCAGCCAAAAATGGGAAj\CTAAATAGC^ 

35 ATACTAGACTGC AAGCA CGGAATTTCCAAGATTGGCTTTGGGCTATCAGTAAAGTGTrrACT 

ACTTTAGTTATCGTTTTATTGGCGAAGATCAGTTGACCAGAGAAGCTATCGGAGAGTTGGAAACCAAGAGTCCTCT 

ccaj\gatagcrccca^gcagacaatcgtcagtcagataccatcaaagaagctctggaaatgctgaaggaggtgga 
agtttataatactcttcaccgcgcagctattgaacttcctagtgttcaaaccccaagtcaaatcaa 

GAACCAGTTATGGATATGGAAGGTGTCGAGATTGCTGGTCAAGGTCAGTCAGTAGGCAAGAAuAATCAGCTTCGAT 
40 TTGCCAGAn 1 1 1 CAj\CCAAAGAAAAGGTAACTGGaGCTGAGATTGGTAGTGCTACTCACGAACTCATGCAGAGA 

ATTGACCTCACCCAGCAACTAACCCTrGCTAGCCT AACA GAAACACTCAAACAAGTTCAAACTAGCCAAGCT 
AGAGACAAGATCAATCTTGATAAAAI'ICI IGCl I icitigacacagtactcggtcaggaaattcttgctaataccg 
accatctttatcgcgagcaacctttctccatgctcaaacgagaccaaaagagtcaggaagactttc^ 
tatccttgatggctatctgctttacgaaaacaaaattgttctgttcgactacaagacagaccgct 
45 agtcaactcgtagaccg ctatc gtggtcagttagctctatacgaagaggctttatcacgagcctattcgattgaaa 

ATATTGAAAAATACTTGATTTTACTCGGTAj^j\GACGACGTTCAAGTTGTAAAAGTATAA 
4109.1 

ATGGAACTTGCTCGCCATGCTGAAAGCTTGGGAGTAGATGCTATTGCAACGATTCCACCAATTTAT^ 
50 CAGAATACTCAGTTGCCAAATACTGGAACGATATCAGTTCTGCAGCTCCAAACACAGACTACGTGATTTACAACA 
TTCCTCAATTGGCAGGGGTTGCnTrGACTCCAAGCCTTTACACAGAAATGTTGAAAAArCCTCGTGTTA T^ 
GAj^GAACTCTTCTATGCCAGTTCAAGATATCCAAACCTTTGTCAGCCTTGGTGGAGAAGACCATATC 
GGTCCrGATGAGCAGTTCCTAGGAGGACGCCTCATGCGGGCTAGGGCTGGTATCGGTGGTACrrATGGTGCTATG 
CAGAACTCTTCTTGAjflu^CTCAATCAGTTGATTCCGGATAAGGACCTAGAAACACCGCGTGAATTGCAGTATGCT 
55 CAACGCAATCATTGGTAAACTCACTTCTGCTCATCGAAATATGTACGGTGTCATCAAAGAAGTCTTGAA^ 

GAAGGCrTGAATATTGGATCrCTTCGTTCACCATTGACACCAGTGACTGAAGAAGATCGTCCAGTTGTAGAAGCG 
GCTGCTGCCTTGATTCGTGAAACCAAGGAGCGCTTCCTCTAA 



60 ATGTATAAGACAAAGTG TTTACG AGAGAAGTTAGTATTA i I ' l IT AAAAA rri ' i LTl CCCAATCCTGATCTACCAAT 

TTGCCAATTATTCTGCCTCi 1 M GTTGATAC TGCAATGACAGGTCAATACAACACTATGGACTTGGCTGGTGTATCT 
ATGGCAACCAGTATCTGG AATCC 11 IClU'l ACATTTCTAACaGGGATTGTGTCAGCCTTGGTGCCTATCATTGGTCA 
CCATCTTGGTCGAGGCAAAAA GGAAG AAGTTGCGTCTGA I I 1 ri ACCAATrrATTTATTTGGCCTTGGGCCTATCT 
GTGGTCTTGCTGGGGAT GGTA CT TTrCT TGGCACCAATAATCTTGAATCATATTGGGTTAGAAGCA^ 

65 CAGTAGCGGrrCGCTATCTTTGGTTTTTATCTATCGGGATTATCCCCTTGTTGCrCTTTAG^ 




TGGATTCGCTCGGCTTGACCAAACTCTCCATCTACCrCATGLl 1 I iCTTACTCCCTCTCAATAGCGGATTTAACTAT 
CTCTTGATTrACGGTGCCTTTGGTGTTCCAGAACTGGGAGGGGCTGGTGCTGGT^ 

GGGTCITGCTTGGGATTTCTGTTCTG Oi 1 1 f ATTTAAACAGCAGAAGCTCAAAGCCTTACACCTTGAGAAACGAAT 
TCCACTTAATATGGATAAAATTAAGGAAGGAGTTCGTTTAGGTCTGCCTATTGGGGGAACTGTC^ 
5 GCTAT Cn i i CAGTGGTTCGCTTGATTATGGCTAA Gl I ' Tl CCCCCTTGATTATAGCTAGTCACCAGTCAGCTATGAA 

CrrTTCAAGTCTTATGTACGCCITTCCTATGAGTATCTCATCGGCTATG^ 

CCAAGCGATTTGATGATGCGAAAACCTATATTGGTCTAGGAAGATGGACTGCCCTCAl 1 1 1 1 GCGGCCTTCACCTT 
AACCTTCCTTTACATTTrTAGGGGAAATGTGGCCAGTCrrTATGGTAACGACCC^^ 

C GI i 1 1 i I AACTTATAGT Li lULi 1 CCAGTTAGCAGATACCTTTGCGGCGCCGCTTCAGGGAATnTGCGGGGGTA 
10 TAAGGATACAGTTATTC Cl 11 1 1 ACCTTGGTTTGCTTGGTTATTGGGGCGTAGCAATCCCTGTGTACGCTATTTGA 

4112.2 

ATGAGTACTTTAGCAAAAATAGAAGCGCTCI I G I 1 IGTAGCGGGTGAAGATGGGATTCGGGTCCGCCACTTAGCT 
GAACTCCTCTCTCTGCCACCGACAGGCATCCAGCAAAGTTTAGGAAAATTAGCCCAGAAGTATGAAAAG G^ 
15 GATTCCAGTTTGGCTTTGATTGAGACAAGTGGTGCTTATAGATTGGTGACCAAGCCTCAATTT^ 

aggaatactctaaggcgcctatcaaccagagcttgtctcgggctgcccttgagaccrrgtccat^ 
acagccgattacgcggatagaaattgatgccatccgtggagttaactcgagtggagccttggcaaagttgcaggc 
ttttgacctgataaaggaagacgggaaaaaggaagtattggggcgccccaacctctatgtgactacggattatt^ 
cctagattacatggggataaaccatttagaagaattaccagtgattgatgagcttgagattcaagcccaagaaag 
20 ccaattatttggtgaaaggatagaagaagatgagaatcaataa 

4113.1 

ATGGATACGATGATTAGTAGA I 1 11 ri CGCCATTTATTTGAAGCCrrAAAAAGTTTGAAACGAAATGGTTGGATGA 

cagtagctgctgtcagttcagtcatgattactttgaccntggtggcaatatttgc^\tctgttat^^ 
25 aaactagctacagatattgaaaataatgtccgtgtagtagtttatatccgaaaggatg tgga agataatagtcag 
acl\attgaaaaagaagg tcaaa ctgttacaaataatgactaccacaaggtatatgattctt^ 
acggttaaaagtgttaccttttcaagtaaagaagaacaatatgaaaaattaaccgagat;^ 
aaaatctttgaaggagatgccaatcctctctatgatgcctatattgtagaggcaaac^ 
actatagccgaagatcctaaaaaaattgaagctgtctcrgagg'rtcaagatggcggtgccaatac agaa agactc 
30 ttcaacttagcttcatttatccgtgtttggggactagggattgctg ll 1 ign aa ltl 1 1 atcgcagtl'm cl igat 
ttcaaataccattcgtattaccattatttcccgcagtcgcgaaattc>aatcatgcg<ntcgt 

AGTTATATCCGTGGACC U Tl L T l U Tl AGAAGGAGCCTTTATCGGTTTATTGGGAGCTATCGCACCATCTGTTTTGGT 

ctttattctttatcaaattgtttaccaatctgtcaacaaatcgttggtagggcaaaatct 
gatttatttagtccgttgatgattgccctactatttgtgattggggttttcattggttcattcggatca 
35 catgccccgattcttgaagatttag 

4117.1 

ATGAAGAAAGTAAGATTTArri I I Tl AGCTCTGCTA riTri CI 1 AGCTAGTCCAGAGGGTGCAATGGCTAGTGATG 
GTACTTGGCAAGGAAAACAGTATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGM I 1 IGATACTCATT 

40 ATCAATCTTGGTTCTATATAAAAGCAGATGCTAACrATGCTGAAAATGAATGGCTAAAGCAAGGTGACGA 

TTACCTCAAATCTGGTGGCTATATGGCCAAATCAGAATGGGTAGAAGACAAGGGAGC CI 1 1 i ATTATCTTGACCAA 
GATGGAAAGATGAAAAGAAATGCTTGGGTACGAACTTCCTATGTTGGTGCAACAGGTGCCAAAGTAATAGAAGAC 
TGGGTCTATGATTCTCAATACGATGCTTGGTTTTATATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTC 
CAAATTAAAGGGAAGGACTATTATTTCAAATCCGGTGGTTATCTACTGACAAGTCAGTGGATTAATCAACCTTA 

45 TGAATGCTAGTGGTGCCAAAGTACAGCAAGGTTGGC 1 IT Tl GACAAACAATACCAATCTTGGTTTTACATCAAAGA 

AAATGGAAACTATGCTGATAAAGAATGGATTTTCGAGAATGGTCACTATTATTATCTAAAATCCGGTGGCT 
GCAGCCAATGAATGGATTrGGGATAAGGAATCTTGGTTTTATCTCAAATTTGATGGGAAAA 
TGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACATGACAGCC^ 
GGGATAAGGAATCTTGGTTTTATCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTACGAT^ 

50 GTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACATGACAGCCAATGAATGGATTTGGGATAAGGAATC^ 

TTACCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAj\GCTTGGTACTAC^ 
AAATCTGGTGGCTACATGGCGAAAAATGaGACAGTAGATGGTTATCAGCTTGGAAGCGATCGTAAATGGCTTGGA 

ggaaaaactacaaatgaaaatgctgcntactatcaagtagtgcctgrracagcavatgt^ 
aaaagctttcctatatatcgcaaggtagtctcgtatggctagataaggatagaaaaagtgatgacaagc gct^ 
55 ctattactatttctggtttgtcaggctatatgaaaacagaagatttacaagcgctagatgctagtaag 

CCCTTATTATGaGAGTGATGGCCACCG 1 1 1 i lATCACTATGTGGCTCAGAATGCTAGTATCCCAGTAGCTTCTCAT 

CTTTCTCATATGGAAGTAGGCAAGAAATATTATTCGGCAGATGGCCTGCAT^ 

CCTTCCTTTTCAAAGATTTAACAGAGGCTACAAAC TACAG TGCrGAAGAATTGGATAAGCT 

CATTAACAATAGCC 1 ri rGGAGAACAAGCGCGCTAClU'l i aacgaagccgaagaacattaccatatca atgctct t 

60 TATCTCCTTGCCCATAGTGCCCTAGAAAGTAACTGGGGAAGAAGTAAAATTGCCAAAGATAAGAATAA'l'ri C 1 1 TG 

GCATTACAGCCTATGATACGACCCCTTACCTTTCTGCTAAGACATTTGATGATGTGGATAAGGGAA'rril AGGTGC 
AACCAAGTCGATTAAGGAAAATTATATCGATAGGGGAAGAACTTTCCTTGGAAACAAGGCTTCTGGTATGA^ 
GGAATATGCTTCaGaCCCTTATTGGGGCGAAAAAATTGCTAGTGTGATGATGAAAATCAATGAGAAGCTAGCT 
CAAAGATTAG 

65 
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4U9.2 

ATGAAAAAAGTATTACAAAAATATTGGGCATGGGCTTTTGTGGTCATCCCCCTCr^ 

ATGTGCCGATGTTTCAACGAGCLI 1 I I ACAGTTTTACCAACTGGACAGGATTGACTTATAACTACAAATTTCTTGG 
CXrAAACAACTrrAj\GCTCCTCTTCATGGATCCAj\AATTCATGAATGCGA^ 
5 GCCATGGTCGTTGGTGAGATTGCACrCGGGATCTTCATTGCGCGTGTCTTGAATTCTAAAATCAAATC 
TCrrCCGTGCTTGGTTCTTCTTCCCAGCTGTTTTATCTCGTTTGACAGTGGC^^ 

TACGGTCTTCCAGCGATTGGAAATGCCCTTCATATTGAA rrri r CCAAACCAGT t 11 I 1 AGGG ACTAAGTGGGG AG 
CAATCTTTGCGGCTGTCTTTCTC Li ICI 1 I GGCAAGGGGTGGCTATGCCCATCATCATCTrCCTAGCTGGTTTGCAA 

tctattccaactgagattacagaggcagcaaggattgatggtgcgactagcaagcaagttttctggaacattga^ 
1 0 ttgc cttacttgctaccaagtgtctctatggtctttatcctagccctaa^ 

CTrTGCCATGACCGGTGGTGGTCCAj\ACAj\TGCCACAACCrCACTTGGGCTCITGGT^ AAA 
AACAACCAATTCGGTTATGCCAATGCCATTGCCGTAATCTTGTTCI ICl l AATTGTAGTGATTTCGATCATCCAATT 
GAGAGTATCTAAGAAATTTGAAATTTAA 

15 4119.3 

ATGATGAAACAAGATGAAAGAAAAGCCCTGATTGGCAAATACATTCTATTGATTCTAGGATCGGTTCTGAr^ 
TGCCGCTCCtTGCTACCCTCTTTAGTTCCTTCAAACCCACTAAGGATATTGTAGATAA'lU 
AACTTCACATGGGACAACrrTAGCCGTCTCTTAGCTGATGCGATTGGAGGCTATTATTGGAACrCTCTCG^ 
CTGTCTTGTCTTTACTTGCAGTAATGATCTTTATCCCTATGGCAGCCTACTCCATCG 

20 AAAAGCCTTTACCATCATGTATACCCTCTTAATCCTCGGAATCrTCGTACCTTTCCAAGTC^ 
CGGTT ATGA TGAGTAAACTCGGTTTGGCTAATACCTTTGGTTTGATCTTGCTCTACTr 

ACCCTCTTTCTCTATGTTGGCTATATCAAJ^TCTCGATTCCAGAA^GTCTGGATGAAGCAGCAGAGATCGAT^ 
CTAATCAATTrACAACCrATTTCCGCATCATCTTCCCAATGATGAAj\CCGATGCATGCGACAACCATGATCATC^ 
TGCCCTTTGGTTCTGGAATGACTTCATGTTGCCACTCCTTGTCTTGAACCGGGATTCCAAAATGTGG 
25 TGTTCCAATACAACTACGCACGCCAATATTTCAj\CGaCTACGGaCCAAGCTTTGCCTCTTACGTGCTCGGCAT^ 
CAGTATCACCATTGTCTATCTL 1 1 C IICCAACGCCATATCATTTCAGGAATGAGCAACGGGGCAGTGAAGTAA 

4119.4 

ATGAAAAGTATTCrrCAGAAAATGGGGGAGCATCCGATGCTGCl ICl ITnCiTAGCTATAGTACTGTTATATCCA 
30 TTCT TGCA CAAAATTGGATGGGTCTTGTGGCTTCAGTAGGAATGTTTCTATrTACT Al 1 11 CI U 1 I GCACTATCAG 

TCGATTTTATCCCATAAATTCTTrCGATTGATnfTTGCAGTTTGTCTTGT^ 

AGTTTAGAACATTTCCAAA TTGTG AAGAAATTTAACTATGCr 1 1 1 ClTl CACCCAATATGCAGGTGTG GCATC AGA 
ACCGGGCAGAAGTGACCTTCmAATCCTAATTATTATGGAATTATTTGTTGTTTCTCT 

CTCT TTACAACGACCAAGTTGAATTGGTTGAAAGTATTCTGTGTGATTGCAGG CI 1 IGl 1 aatctctttggtttgaa 

35 CrrrACTCAAAATCGAACTGCCTTTCCTGCTATTATCGCTGGAGCAATTATCT 

GGAAGGC LTI r i GGCnTAGTATrGGGGTCrrCGCGATTGGTTTGAGTTrCCT Cl 1 1 I CTACTGATrrGGGAGTTCGA 
ATCGGTACTTTAGACTCITCTATGGAAGAACGCATTTCTATCTCGGATGCTGCGATGGC Cl Id 11 l AAGCAAAATC 
cm riGG GGTGAAGGGCCATTGACCTATATGAACTCTTATCCTCGGATACATGCTCCTTATCATGAACATGCCCA 
CAGTCTTTATATTGATACGATTCTGAGTrACGGAATTGTGGGGACTA n ' l ' l 'ATTAG fl ' l ' l GTCTTCTGTTGCTCCTG 

40 TTCCCTTGATGATGGAT ATGAG TCAGGAGTCGGGGAAACGTCCGATTATCGGCCTTTATCTATCTTTCCTTACAGT 
GGTTGCTCTCCACGGAA rr rri'GACTTGGCTCTCTTCTGGATTCAGTCAGGCTTT Ai I li CI I GCTAGTrATGTGCA 
GTATTCCATTGGAGCATCGAATGTTGGTATCGGACATGACGGATTAA 

4120.1 

45 ATGTCAAAGATGGATCTTCAGAAAATCATTGCACCGATGATGAj\CTTTGTGAATATGCGTGGCATTATAGCTCTAA 
AAGATCGGATGTTAGCAATTTTGCCA TTGAC AGTAGTTGCTAGTTTGTTCTTGATTATGGGACAATTGCCGTTCG^ 
AGG ATTAAATAAG AGCATTGCTAO 1 0 1 1" 1 ' 1 I GG ACCTAATTGG AC AG AGCCGTTTATGC AAGTATATTCAGG AACT 
TTTGCTATTATGGGTCTAAl I'lCriGl 1 li 1 C AATrG CCTATTCTTATGCTAAGAATAGCGGAGTAGAGGCTTTACC 
AGCTGGAGTTCTATCrGTATCTGCATTCTTTATTTTGCTAAGATCATCITATATCCCTAA^ 

50 GGGACGCTATT AGTAA AGTTTGGTTTGGAGGCCAAGGAATTATCGGTGCTATCATTATAGGTTTGGTAGTAGGAAG 
TAT TTATACCl 1 CUl 1 ATAAAGAGAAAAATTGTTATTAAGATGCCAGAACAAGTTCCACAACCTATTGCCAAACAG 
TTTG AAGCAATG ATTCCAGCATTTGT AA T 1 1 1 C 1 I ATCTTCTATG ATTGTATATATTTTAGCG AAGTCATTGACTAA 
TGGCGGAACATTCATA GAAAT GATTTATrCTGCTATTCAAGTTCCGTTGCAAGGTTTAACTGGATCrrrCTATGCT 
GCTATTGGAATTCCATTCTTTATATCATTTTTGTGGTGGTT^ 

55 GACAGCTCTGCTTTTATCTAA TCTTG ATGCTAA TAAAG CTATGTTAGCCTCTGCTAATCTATCAT^ 
GCACATATTGTTACTCAACAATTTTTAGATTCATTTTTAATT^ 

TGCCATGCTTTTTGCAGCAAAATCAAj^ACAATACCAAGCCTTAGGAAAACTTGCAGCT^^ 

gtaaatgagccagttgtattrggatttccgattgtcatgaatccagttatgtttgtac cul 1 catt crib 1 1 cctgt 
acttgcacctgtg atag tatatgg agcta ttgcaacagcrrrcatgcagccattctcaggggtaacattgcc^ 
60 agtacaccagc tattttatc agga ttttt ggtgggtggatggcaaggagttattactcagctggtgatattagcga 
tgtctacattggtttattrtccattctttaaj\gtacaggatcgtttagcttacca>^ 

4121.2 

ATGAAGAAAAAGGACTTAGTAGACCAACTACTCTCAGAGATCGAGACGGGGAAAGTCAGGACACTGGGAATATA 
65 CGGTCATGGAGCTTCAGGTAAATCAACCTTTGCACAGGAATTGTACCAAGCTTTAGATTCTACTACAGTAAATT^ 
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CTAGAGACAGATCCTTATATCACCTCAGGACGCCATCTCGTAGTACCCAAGGACGCGCCCAATCAAAAGGTGACA 
GCCAGTCTGCCAGTGGCGCATGAACTGGAGACTTTCCAGAGAGATATCCTTgCTTGCAGGCGGGTATGGATCTCrr 
GA 

5 

4122,1 

ATGAAGAAAAGATACCTAGTCTTGACAGCTTTGCTAGCCTTGAGTCTAGCAG Cl 1 OT TCACAAGAAAAAACAAAA 
AATGAAGATGGAGAAACTAAGACAGAACAGACAGCCAAAGCTGATGGAACAGTCGGTAGTAAGTCTCAAGGAGC 
TGCCCAGAAGAAACCAGAAGTGGTCAATAAAGGTGATTACTACAGCATTCAAGGGAAATACGATGAAATCA 

10 AGCCAACAAACACTATCCATTGTCTAAAGACTATAATCCAGGGGAAAATCCAACAGCCAAGGCAGAGTTGGTC^ 
ACTCATCAAAGCGATGCAAGAGGCAGGTTTCCCTATTAGTGATCATTACAGTG Oi 1 1 1 AGAAGTTATGAAACTCAG 
ACCAAGCTCTATCAAGATTATGT CAAC CAAGATGGAAAGGCAGCAGCTGACCGTTACTCTGCCCCTCCTGGCTAT 
AGCGAACACCAGaCAGGCTTGGCCTTTGATG TGATT GGGACTGATCGTGATTTGGTGACAGAAGAAAAAGCAGCC 
CAATGGCTCTTGGATCATGCAGCTGATTATGGCI 1 lOllGTCCGTTATCTCAAAGGCAAGGAAAAGGAAACAGGCT 

15 ATATGGCTGAAGAATGGCACCTGCGTTATGTAGGAAAAGAAGCTAAAGAAATTGCTGCAAGTGGT 
AAGAATACTATGGCTTTGAAGGCGGAGACTACGTCGATTAA 

4125.6 

atgcgtaaattcttaattattttgttgct acca ag i mm igaccatttcaaaagtcgttagcacagaaaaagaag 
20 tcgtctatacttcgaaagaaatttattacctttcacaatctgactttggtatttatrr^ 

ccca tggt ttatggagaggttcctctttatgcgaatgaagatttagtagtggaatctgggaaattgactcccaaaa 

caagttttcaaataaccgactggcgcttaa-ataaacaaggaattccagtat^ 
. agct gcggacaaacgatttrtatatgatcaatcagaggtaactccaacaataaaaaaagtatggttagaatct^ 

ctrraaactgtacaatagtccttatgatttaaaagaagtgaaatcatccttatcagcrratt 
25 gacaagaccatgtttgtagaaggaagagaatttctacatattgatcaggctggatgggtagctaaagaatcaac^ 

tctgaagaagataatcggatgagtaaagttcaagaaatgttatctgaaaaatatcagaaagattctitctct^ 

ATGTTAAGCAACTGACTACTGGAAAAGAAGCTGGTATCAATCAAGATGAAAAGATGTATGCAGCCAGCGTTTTG^ 

aactcrcttatctctattatacgcaagaaaaaataaat^ 

atctgcag tcaa tgattttccaggttcttataaaccagagggaagtggtagtcttcctaaaaa^ 
30 agaatattctttaaagcatttaattacgaaagtatcaaaagaatctgataatgtagctcataatctattggga 

tacatttcaaaccaatctgatgccacattcaaatccaagatgtctgccattatgggagatgattgggatccaaaag 

AAAAArrGATTTCrrCTAAGATGGCCGGGAAGTTrATGGAAGCTATTTAT 

tttgactaaaacagattttgatagtcagcgaattgccaaaggtgtttctgttaaagtag 
c tcat aaaattggagatgcggatgaatttaagcatgatacgggtgttgtctatgcagattctccatttattcm 
35 attttcactaagaattctgattatgatacgatttctaagatagccaaggatgtttatgaggttctaaaatc 

4125.7 

ATGAAAAAACAAAATAATGGTTTAATTAAAAATC C i i r 1 CTATGGTTATTATTTAT Ci rri 1 ' CCTTGTGACAGGATT 
CCAGTATTTCTATTCTGGGAATAACTCAGGAGGAAGTCAGCAAATCAACTATACTGAGTTGGTACAAGAAATTAC 
40 CGATGGTAATGTAAAAGAATTAACTTACCAACCAAATGGTAGTGTTATCGAAGTTTCrGGTGTCTATAA^ 

AAAACAAGTAAAGAAGAAACAGGTATTCAGTTTTTCACGCCATCTGTTACTAAGGTAGAGAAATTTACCAGC^ 
ATTCTTCCTGCAGATACTACCGTATCAGAATTGCAAAj\ACTTGCTACTGACCATAAAGCAGAAGTAA^ 

atg aaagttcaagtggtatatgg att aatctactcgt atccattgtcccatttgg aattctatt c 1 ' i 1 1 1 cctattc 
tctatgatgggaaatatgggaggaggcaatggccgtaatccaatgagttttggacgtagtaaggctaaagcagca 
45 aataaagaagatattaaagtaagattttcagatgttgctggagctgaggaagaaaaacaagaactagttg^ 

GTTGaGTTCTTAAAAGAT CCAAA ACGATTCACAAAACTTGGAGCCCGTATTCCAGCAGGTG I ICi I 1 i GGAGGGAC 

ctccggggacaggtaaaactttgcttgctaaggcagtcgctggagaagcaggtgttccat^ 
ttctgacrittgtagaaatgtttgtcggagt^ 

gcaccagctatcatctttatcgatga aattga tgctgttggacgtcaacgtggagtcggtctcggcggaggtaatg 
50 acgaacgtgaacaaaccttgaaccaacttttgattgagatggatggttttgagggaaatgaagggattatcgtca 
tcgctgcgacaaaccgttcagatgtacttcaccctgcccttttgcgtccaggacgttttgatagaaa^ 
tggt cgtcctgatgttaaaggtcgtgaagcaatcttgaaagttcacgctaagaataagc 
ctttagcagaagat gttg atttgaaattagtggctcaacaaactccaggc l 1 lu r iggtgctgatttagagaatgt 
cttgaatgaagcagctttagttgcrgctcgtcgcaataaatcgataattgatgcttcagatattgatg^ 
5d gatagagttattgctggaccttctaagaaagataagacagtttcacaaaaagaacgagaattggttgcttaccat 
gaggcaggacataccattgttggtctagtcttgtcgaatgctcgcgttgtccataaggttacaattgtaccacgcg 
gccgtgcaggcggatacatgattgcacttcctaaagacgatcaaatgcttctatctaaagaagatatgaaagagc 
a attg gctggcttaatgggtggacgtgtagctgaagaaattatctttaatgtccaaaccacaggagcttca 
actttgaacaagcgacacaaatggcacgtgcaatggttacagagtacggtatgagtgaaaaacttggcccagtac 
50 aatatgaaggaaaccatgctatgcttggtgcacagagtcctcaaaaatcaatttcagaacaaacagcttatcaaa 
ttgatgaagaggttccttcattartaaatgaggcacgaaataaagctgctgaaattattcagtcaaatcgtgaaac 
tcacaagtraattgcagaagcattattgaaatacgaaacattggatagtacacaaattaaagctctttacgaaac 
aggaaagatgcctgaagcagtagaagaggaatctcatgcactatcctatgatgaagtaaagtcaaaaatgaatga 

CGAAAAATAA 
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4125.10 

ATGAGCGAACCAGATTTTTTAAATCATTTTCTCAAGAAGCGATATTTCA^ 
TTTCTGGTGGATTAGATTCCATGTTTCrATTTAAGCTATTGTCTACTTATCAAAA^ 

CTAGCTCATGTCAATCATAAGCAGAGAATTGAATCAGATTGGGAAGAAAAGGAATTAAGGAAGTTGGCTGCTGAA 
5 GCAGAGCTTCCTATTTATATCAGCAATTTTTCAGGAGAATTTTCAGAAGCGCGT 

TTTTTCAAGAGGTCATGAAAAAGACAGGTGCGACAGCTTTAGTCACTGCCCACCATG 

CGA I I I 1 i ATGCGCTTGATTCGAGGAA CTCG CTTGCGCTATCTATCAGGAATTAAGGAGAAGCAAGTAGTCGGAGA 
GATAGAAATCATTCGTC CCl i Ci IGCATTTTCAGAAAAAAGACTTTCCATCAAi i I I 1 CACTTTGAAG AT AC ATCA 
AATCAGGAGAATCATTATTTTCGAAATCGTATrCGAAATTCTTACITACCAGAATTGGAA^ 
1 0 TTAGGG ATGCAATCTTAGGCATTGGCAATG AAATTTTAG AT^ 

TAATGTGGAAGATTTACAGCAGTTA'nriCri ACTCTGAGTCTACACAAAGAOl 1 1 1 ACTTCAAACTTATCTG AATC 
GTTTTCCAGATTrGAATCTTACAAAAGCTCAGTTTGCTGAAGTTCAGCAGATT^ 

TCATCCGATTAAAAATGGCTATGAATTGATAAAAGAGTACCAACAOl I ICAGAl l l'G I'AAAATCAGTCCGCAGGCT 
GATGAAAAGGAAGATGAACITGTGTTACACTATCAAAATCAGCTAGCTTATCAAGGATATTTAI 1 1 ICl 1 I IGGAC 
1 5 TTCCATTA GAAG GTGAATTAATTCAACAAATACCTGTTTCACGTGAAACATCCATACACATTCCT 

AGGAGATGTTTTGATTAAAAATGGCCATAGAAAAAAACTCAGACGTTTATTTATTG 

ATTTGAAAATCCCTATGGAAAAGAGAAACTCrGCTCTTATTATTGAGCAATTTGCT 

AATTGCGACCAATAATTTGAGTAAAAAAACGAAAAATGATATAATGAACACTGTACTTTATATAGAA^^ 
TAGGTAA 

20 

4126.1 . 

ATGAAGCGTTCTTCTCTTTTAGTTAGAATGGTTATTTCCATCTTTCT^ 

ACTTTCT ACT ATC AATC AAO i 1 L H CAGCCATTGAGGCCACCATTGAGGGCAACAGCCAAACGACCATCAGCCAG 

acragccactttattcagtcttatatcaaaaaacragaaaccacctcgacrggtttgacccagcagacggatgttc 
25 tggcctatgctgagaatcccagtcaagacaaggtcgagggaatccgagatttgtttttgaccatcttgaagtcaga 
taaggacttgaaaactgttgtgctggtgaccaaatctggtcaggtcatttctacagatgacagtctgcagatgaa 

AACTTCCTCTGATATGATGGCTGAGGATTGGTACCAAAAGGCCATTCATCAGGGAGCTATGCCT CjI 1 i I GACTCCA 
GCTCGTAAATCAGATAGTCAGTGGGTCATTTCTGTCACTCAAGAACri G 1 iGATGCAAAGGGAGCCAATCTTGGTG 
TGCTTCGTTTGG ATA I'l l C 1 I ATGAAACTCTGG AAGCCTATCTCAATCAACTCCAGTTGGGGCAGCAGGGCTTTGC 

30 CTTCATTATCAATGAAAACCATGAATTTGTCTACCATCCTCAACACACACTTTATaGTTCGTCTAGCAAAATGGAG 
GCTATGAAACCCTACATCGATACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATT 
GCAGGAACTGATTGGACGGTGCTTGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAGCTCTTGTGG 
AC CTTG CTTGGGGCCAGTGTCACATCTCr I CI rGTCTGTCTCTGCTTAGTGTGGTTCAGTCTTAAACGCTGG ATTGC 
TCCrinGAAGGATTTGAGAGAAACCATGTTGGAAATTGCTTCTGGTGCTCAAAATCTTCGTGCCAAGGAAGTTGGT 

35 GCCTATGAACTGAGAGAAGTAACrCGCCAATTTAATGCTATGTTGGATCAGATTGATCAGTTGATGGTAGCTATTC 
GT AGCC AGGAAGAAACGACCCGTCAGTACCAACrrCAAGCCCTTrCGAGCCAGATTAATCCACATTTCCTCTATAA 
CAC I ■ 1 ' 1 GG ACACCATC ATCTGG ATGGCTG AATTTCATG AT AGTCAG CG AGTGGTGC AGGTG ACCAAGTCCTTGGCA 
ACCTATTTCCGCrrGGCGCTCAATCAAGGCAAGGACTTGATTTGTCrCTCTGACGAAATCAATCATGTCCGCCAGT 
ATCTCTTTATCCAGAAACAACGCTATGGAGATAAGCTGGAATACGAAATTAATGAAAATGTTGCCITTGATAAT^ 

40 AGTCTTACCCAAGCTGGTCCTACAACCCCITGTAGAAAATGCTCrrTACCATGGCATTAAGGAAAAGGAAGGTCA 

gggccatattaaactttctgtccagaaacaggattcgggattggtcatccgtattgaggatgatggcgttggcttc 
caagatgctggtgatagtagtcaaagtcaactcaaacgtgggggagttcgtcttcaaaatgtcgatcaacggctc 
aaacttcattttggagccaattaccatatgaagattgattctacaccccaaaaagggacgaaagttgaaatatat 
at aaat ac aat ag aaa ct ag ct aa 

45 

4126.7 - 

A TGAA GCGTTCTTCrCTTTTAGTTAGAATGGTTATTTCCATCTTTCTGGTCTTTCTCATTCTCCr 
ACTTTCTA CTATC AATCAAGrrCi'l'CAGCCATTGAGGCCACCATTCAGGGCAACAGCCAAACGACCATCAGCCAG 

50 ACTAGCCACrrTATTCAGTCTTATATCAAAAAACTAGAAACCACCTCGACrGGTTTGACCCAGCAGACGGATGTTC 
TGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATrrGTTTTTGACCATCTTGAAGTCAGA 
TAAGCACTTGAAAACTGTTGTGCTGGTGACCAAATCTGGTCAGGTCATrrCTACAGATGACAGTGTGCAGATGAA 
AACTTCCTCTGATATGATGGCrGAGGATTGGTACCAAAAGGCCATTCATCAGGGAGCTATGCCTGTrTTGACTCCA 
GCTCGT AAAT CAGATAGTCAGTGGGTCATTTCTGTCACTCAAGAAC'l l Ul I GATGCAAAGGGAGCCAATCTTGGTG 

55 TGCTTCGTTTGGATAl 1 ICl i ATGAAACTCTGG AAGCCTATCTCAATCAACTCCAGTTGGGGCAGCAGGGCTTTGC 

cttcattatcaatgaaaaccatgaatrrgtctaccatcctcaacacacagtrratagttcgtctagcaaaatggag 
gctatgaaaccctacatcgatacaggtcagggttatactcctggtcacaaatcctacgtcagtcaagagaagatt 
gcaggaactgattggacggtgcttggcgtgtcatcattggaaaagttagaccaggttcggagtcagctcttgtgg 
ac cttg cttggggccagtgtcacatctcttcttgtctgtctctgcttagtgtggttcagtcttaaacgctggattgc 

60 TCCTTTGAAGGATTTGAGAGAAACCATGTTGGAAATTGCTTCTGGTGCTCAAAATCTTCGTGCCAAGGAAGT^ 

GCCTATGAACTGAGAGAAGTAACTCGCCAArTTAATGCTATGTTGGATCAGATTGATCAGTTGATGGTACCTATTC 
GT AGCC AGGAAGAAACGACCCGTCAGTACCAACTTCAAGCCCTTTCGAGCCAGATTAATCCACATITCCTCTATAA 

cactttggacaccatcatctggatggctgaatttcatgatagtcagcgagtggtgcaggtgaccaagtccttggca 
acct attt ccccttcgcgctcaatcaaggcaaggacttgatttgtctctctgacgaaatcaatcatgtccgccact 
65 atctctttatccagaaacaacgctatggagataagctggaatacgaaattaatgaaaatgttgcctttgataattt 
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AGTCTTACCCAAGCTGGTCCTACAACCCCTTCrrAGAAAATGCTCTTTACCATGGCATTAAGG^^ 
GGGCCATArrAAACTTTCXGTCCAGAAACAGGATTCGGGATTGGTCATCCGTATTGAGGATGATGGCGTTGGCTTC 
CAAGATG CTCGT GATAGTACTCAAAGTCAACTCAAACGTGGGGGAGTTGGTCTTCAAAATGTCGATCAACGGCT^ 
AAACTTC>TTTTGGAGCCAATTACCATATGAAGATTGATTCTAGACCCCAAAAAGGGACGAAAGTT 
5 ATAAATAGAATAGAAACTAGCTAA 

4127^4 

ATOri r iWTA AATTATTAAGAGAAGCTCrrAAAGTCAAGCAGGTTCGATCAAAAATTTTATT^ I iATCGT 

TTTGGTCTTTCGTATCCGAACTAGCATrACAGTTCCTGCTGTGAATGCCAATAGCTTGAATGCT^ 
1 0 C Ci i C i I AAACATGTTGAGCTTGGTGTCGGGGAATGCCCTAAAAAA C rri ' I CGA ITITI GCCCTAGGACTTAGTCC 

CTATATCACCGCTTCTATTGTTCTCCAACTCTTGCAAATGGATATTTTACCCAAGTT^ 

GGGGAAGTAGGT CGAAG AAAATTGAATCAAGCTACTCGTTATATTGCTCTAGTTCTCG LI i 1 i GTG CAATC TATCG 
CGATTACAGCTGGTTTTAATACCTTGGCTGGAGCrCAATTGATTAAAACTGCTTTAACTCCAO 
GATTGGTATCATCTTAACAGCTGGTAGTATGATTCTCACTTCGTTGGGTGAGCAAATTACAGATAAGGGATACGGA 
15 AACGGTGTTTCCATGATrATCTTTGCCGGGATTGrrTCCTCAATTCCAGAGATGATTCAGGGCATCT 
ACTTrGTG AACG TCCCAAGTAGCCGTATCACTTCATCTATCATTITCGTAATCATTTTGAT^ 
ATTATTTACTTTACAACrrATGTTCAACAACCAGAATACAAAATrCCAATCCAATATACTAAGGT^ 
CTCCATCTAGCTCTTACCTTCCGTTAAAAGTAAACCCTGCTCGAGTTATCCCTGTrATCT^ 

GCAGCGCCTGCGGCTATTCTTCA O I I U i GAGTGCCACAGGTCATGATTCGGCTTGGGTAAGGGTAGCACAAGAGA 
20 TGTTGGCAACTACTTCTCCAACTGGTATTGCCATGTATGLl 1 IGl 1 GATTATTCTCTTTACATTCTTCTATACGTrTG 

TACAGATTAATCCTGAAAAAGCAGCAGAGACCTACAAAAGAGTGGTGCCTATATCCATGGAGTTCGTCCTGCTAA 
AGGTACAGAAGAATATATGTCTAAA L ' 1 ' 1 C 1 1 CGTCGTCTTGCAACTGTTGGTTCCCTCTTCCTTGGTGTGA 



25 4127.5 

ATGGATATTAGACAAGTTACTGAAACCATCGCC^\TGATTGAGGAGCAAAACTTCGATATTAGAACCATTACCATG 
GGGATTTCTCli 1 I GGACTGTATCGATCCAGaTATCAATCGTGCTGCGGAGAAAATCTATCAAAAAATTACGACAA 

aggcggctaattragtagctgttggtgatgaaattgcggctgagttgggaattcctatcgttaataagcgtgtatc 
ggtgacacctatttctctgattggg gcag cgacaga tgcga cggactacgtcgttctggcaaaagcgcttgataa 
30 ggcrgcgaaagagattggtgtggactttattggtcoi i i i ictgccttagtacaaaaaggttatcaaaagggagat 
gagattctcatcaattccattcctcgcgctttggctcagacggataaggtctgctcgtcagtcaatatcggcto^ 
ccaagtctggtattaatatgacggctgtggcagatatcggacgaattatcaaggaaacagcaaatctrrcagata 
tgggagtcgccaagttcgttgtattccctaatgctgttgaggacaatccatttatggcgggtgccrr^ 

TGGGGAAGCAGATGTTATCATCAATGTCGGAGTTTCTGGTCCTGGTGTTGTGAA^ 
35 GGACAGAGCTrrGATGTACTAGCCGAAACAGTTAAGAAAACTGCCTTTAAAATCACTCGTATCGGTC^ 

GTCAAATGGCCAGTGAGAGACTGGGTGTGGAGTTTGGTATTGTGGACTTGAGTTTGGCACCAACCCCTGCG^^ 

agactctgtggcacgtgtccttgaggaaatggggctagaaacagttggcacgcatggaacgacggctg ccttg gc 
ccrcttgaacgaccaagttaaaaagggtggagtgatggcctgcaaccaagtcggtggtttatctggtgcctt^ 
cctgtttctgaggatgaaggaatgattgctgcagtgcaaaatggctctcttaatttagaaaa^ 
40 cggctatctgttctgttggattggatatgattgccatcccagaagatacccctgctgaaactattgcggctatgat 
tgcggatgaagcaccaatcggtgttatcaacatgaaaacaacagctgttcgtatcattcccaaaggaaaagaagg 
cgatatgattgactttggtggtctattaggaactgcac ccgt tatgaagcttaatggggcrrcgtctgtcgac^ 
atctctcgcggtggacaaatcccagcaccaattcatagttttaaaaattaa 

45 4128.1 

atgacacagattattgatgggaaagctttagcggccaaattgcaggggcagttggctcaaaagactgcaaaatta 
aaggaagaaacaggtctagtgcctggtttggtagtgattttggttggggacaatccagccagccaagtctacgt^ 
cgcaacaaggagaggtcagcccttgcggctggtttccgtagcgaagtagtacgggtrccagagaccattactcaa 
gaggaattgttagacctgattgctaaatacaatcaggatccagcttggcatgggattttggttcagttgccattac 

50 caaaacacatt gatg aagaggcggttctattggctattgacccagaaaaggatgtggatggtttccatcctctaa 
acatggggcgtctttggtctggtcatccagtcatgattccttcgacaccggcaggaattatggaaatgttccatga 
atat gggat tgacttggaaggtaaaaatgcagtcgtcatcggtcgatccaatattgtcggaaaacctatggccca 
gcttcttttggcaaagaatgcaacagtaaccttgactcactcacgtactcataatctttccaaggtc 
gcagatarrctggttgttgcaatcggtcgtgccaagtttgtgactgctgactttgtcaaaccaggtgcggtagtca 

55 ttgacgttgggatgaaccgcgatgaaaatggtaagcrctgtggggatgttgattatgaggcggttgccccacttgc 
tagccacattacgccagtccctggaggtgtcggtcctatgaccattactatgctgatggagcaaacctatcaggca 

GCACTTAGGACATTGGATAGAAAATAA 
4128.2 

60 ATGTCTAAATTTAATCGTATTCATTTGGTGGTACTGGATTCTGTAGGAATCGGTGCAGCACCAGATGCTAATAACT 
TTGTCAATGCAGGGGTTCCAGATGGAGCrrCTGACACACTGGGACACATTTCAAAAACAGTTGGTTTGAATGTCCC 
AAACATGGCTAAAATAGGTCTTGGAAATATTCCTCGTGAAACTCCTCTTAAGACTGTAGCAGCrGAAAGCAA^^ 
AACTGGATATGCAACAA AATT AGAG GAAG TATCTCrrGGTAAGGATACTATGACTGGACACTGGGAAATCATGGG 
ACTCAACATTACTGAGCCTTTCGATACriTCTGGAACGGATTCCCAGAAGAAATCCTGACAAAAATCGAAG 

65 TCAGGACGCAAGGTTATTCGTGAAGCCAACAAACCTTArrCAGGAACGGCTCTTATCTATGATTTTGGACCACGTC 
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AG ATGG AAACTGGAGACriTGATTATCrATACTTCACCTGACCCTGTTTTGCAGATTGCTCCCCACG^ 
TCCTTTGGATGAATTGTACCGTATCTGTG^AATACGCTCCTTCGATTACCCTTGAGCGTCCTGCC C ' ri CI rGGTCGCA 
T CATTG CTCGCCC rrATG TAGGTGAACCAGGTAACITCACTCGTACGGCAAACCGTCGTGACTrGGCT 
ATT TTTCCCj^ACTGTTTTGGATAAATTGAATGAGGCTGGTATCGATACTTATG 
5 TTTAACGGTGCTGGTATCAACCATGACATGGGTCACAACAAGTCAAATAOT 

actatgggacttgctgagtttgaaaaaggattctcattcacaaacctagttgacrrrgatgccc^^ 
gtcgtaatgctcacggttaccgtgattgcttgcatgagtttgatgaacgcttacctgaaattatcgcagctatgag 
agagaatgaccttctcttgattactgcggaccatggaaatgacccaacgtatgcaggaacggatcacactcgcga 
atatattccattgttggcctatagccctgcctttaaa 
1 0 cagcgactgttgccgataactttggtgtggaaactgctatgattggcgaaag i i i ' cl 1 agataaattggtataa 

41 29.2 

ATGTTTATTTCCATCAGTGCTGGAATTGTGACAl 1 i 11 ACTAACTTTAGTAGAj\ATTCCGGCCTn'ATCCAATTTTA 
TAGAAAGGC GCAA A TTACA GGCCAGCAGATG CATGA GGATGrCAAACAGCATCAGGCAAAAGCTGCGACTCCTA 
15 CAATGGCAGGTTTGG TTTT CTTGATTACTTCTGTTTTGGTTGCI 1 ICl 1 iriCGCCCTATTTAGTACCCAATTCAGCA 

ataatgtgggaatgal 1 1 1 0 11 catcttggtcttgtatggcttggtcggatttttagatgactttctcaag 
cgtaaaatcaatgaggggcttaatcctaag caaaa attagctcttcagcttctaggtggagttatcttctat 
tctat gagcccggtc gcgatat cctgtctgtctttggttatccagttcatttgggatttt^ i cgct 

ctttt ctggc tagtcggtttttcaaacgcagtaaacttgacagacggtgttgacggtttagct 
20 gattag tttg tctgc ctat ggagt tattg cctatgtgcaaggtcagatggatattcttctagtgattcttg^ 

ttggtggtttgctcggl 11 cl'lcatctttaaccataagcctgccaag gi c i 1 1 atgggtgatgtgggaagtttggcc 
ctagg tgggatgctggcagctatctctatggctctccaccaagaatggacrctcttgattatcggaattct 

1 i 11 IGAAACAACTT CTGTT ATGATGCAAGTCAGTTATTTCAAACTGACAGCTGGTAAACCTATTTTCCGTATGAC 
G CCTGTA CATCACCATTTTGAGCTTGGGGGATTGTCTGGTAAAGGAAATCCTTGGAGCGAGTGGAAGGTTC 
25 TTCTTTTpGGGAGTGGGACTTCTAGCAAGTCTCCTGACCCTAGCAATTTTATATTTGATGTAA 

4133.1 

TTGTTTAAGAAAAATA AAG ACATTCTTAATATTGCATTGCCAGCTATGGGTGAAAA Cl I 1 1 1 GCAGATGCTAATGG 
GAATGGTGGACAGTrA TTTGG TTGCrCATTTAGGATTGATAGCTATTTCAGGGGTTTCAGTAGCTGGTAATA 

30 CACCATTTATCAGGCGATTTTCATCGCTCTGGGAGCTGCTArrrCCAGTGTTATTTCAAAAAGCATA 
GACCAGTCGAAGTTGGCCTATCATGTGACTG AGGCG TTGAAGATTACCTTACTATTAAGTITCCTTT^ 
GTCCA TCTT CGCTGGGAAAGAGATGATAGGACTT TTGGG GACGGAGAGGGATGTAGCTGAGAGTGGTGGACTGTA 
TCTATCTTTGGTAGGCGGATCGATT GTTCTCT TAGCTTTAATGACTAGTCTAGGAGCCTTGATTCGT^ 
AATCCACGTCTGCCTCTCTATGTTAOl I Tl i'l ATCCAATGCCTTGAATATT Cl 1111 1 CAAGTCTAGCT Al ' n i lUi 1 

35 CTGGATATGGGGATAGCTGGTGTTGCT TGGGG GACAATTGTGTCTCGTTrGGTTGGTCTTGTGA l ri lUl GGTCA^ 

AATTAAAACTGCCTTATGGGAAGCCAACTTTTGGTTTAGATAAGGAACTGTTGACCTTGGCT^ 
AGAGCGACTTATGATGAGGGCTGGAGATGTAGTGATCATTGCCTTGGTCG 1 1 ICri ' ri GGGACGGAGGCAGTTGCT 
GGGAATGCAATCGGAGAAGTCnTGACCCAGTTTAACTATATGCCTGCCTTTGGCGTCGCTACGGCAACG 
TGTTGGCCCGAGCAGTTGGAGAGGATGATTGGAAAAGAGTTGCTAGTTTGAGTAj^j\CA^ 

40 CTTCCTCATGTTGCCCCTGTCCTTrAGTATATATGTCTTGGGTGTACCATTA^ 

TAGCGGTGGAGGCTAGTGTTCTAGTGACACTGTTTTCACTACTTGGGACCCCTATGACGACACGAACAGTCATCTA 
TACGGCAGTCTGGCAGGGaTTAGGAAATGCACGCCTCCCTTTTTATGCGACAAGTATAGGAATGTGGTGTATCCGC 
ATTGGGACAGGATATCTGATGGGGATTGTGCTTGGTTGGGGCTTGCCTGGTATTTGGG 
ATGGTTrrCGCTGGTTATTrCTACGCTATCGTTACCAGCGCTATATGAGCTTGAAAGGATAG 

45 

4135.2 

ATGCAAACTCAAGAAAAACACTCCCAAGCAGCCGTTCTTGGCrrGCAGCACTTACTAGCCATGTACTCAG 

ATCCTGGTTCCCATCATGATTGCGACAGCCCTTGGCTATTCAGCTGAGCAGTTGACCTACCTGATrrCrACAGAT^ 

TCTTCATGTGTGGGGTGGCAACCTTCCTCCAACTCCAACTCAACAAATACTTTGGGAT^^ 

DU TGGAGTTGCATTCCAGTCGGTCGCTCCCTTGATTATGATTGGCCAAA 

CTTATCGCATCTGGGATTTACGTGGTTCTTGTTrCAGGCATCTTCTCAAAAGTAGCCAATCTC^ 
AACACGATCTGTTATTACCACGATTGGTTTAACCTTGATCCCTGTCGCTATTGGAAATATGGGAAATAACGTTCC^ 
GAGCCAACTGGTCAAAGTCT CTTGC TTGCAGCTATTACTGTTCTGATTATCCTCTTGATCAACATC^ 
GATTTATCAAGTCTATCTCTATrrrGATTGGTCTGGTTGTTGGAACTGCCATTGCTGCTACTATGC 

DD TTCTCTCCTGTTGCGGTAGCTCCACnTGTCCATGTCCCAACTCCACTCTACTTTGGCATGCCAACCT^ 
ATCrATTCTCATGATGTGTATCATCGCAACGGTGTCrATGCTTGAGTCAACTGGTGTTTATCrrc 
TCAC AAAG GATCCAATCGACAGCACGCG CCTTC GCAACGGATACCGCGCAGAAGGTTTGGCCGTACTTCTCGGAG 
GAATCTTTAACACCTTCCCTTACACTOGATTTTCACAAAACGTTGGTTT 
CCTGCCAATCTACTACGCAGCTGGTTTCCTGGTTCTCCTTGGACTGCTTCCTAj\GTTTGGCG 

OU TTCCAAGCT CCGT CCTCGGTGGTGCCATGCTGGTAATGTTTGGTTTTCrrATCAATTCAACGGATGCAAATCCT^ 
CCGTCTTGACTTTGCTAACAATGAACACAACTTCCITATCGCAG 

AACAGTAATCTCTTTGTCACCATGCCGACAGCCTTCCAj\ATGTTCTTCTCAAACGGAATCGTCGTAGCCAGCCTA 
TCGCT ATTGTCCTCAATG CCGTATTAAATCATAAAAAG AAAT AA 

65 4136.2 
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ATGAAAGATAGAATAAAAGAATATTTACAAGACAAGGGAAAGCTGACrGTTAATGATTTGGCTCAGGCT ^ 
AAAGACAGTTCCAAGGATTTTCGTGAGTTGATTAAAACCTTCTCCTTAATGGA^ 
AAGAACATGGTAGTCTGACATTAGAAATTAAGAAAAAACATGAGATTACCCTCAAGCGGATTTT^ 
AAAATGGCTTTGGCrrrCTTAGTCTGGAAGGCGAGGACGACGACCTTTTTGTAGGGAA^ 
5 TATTGATGGTGATACCGTCGAGGTAGTGATTAACAAAGTCGCTGACCGCAATAAGGGAACAGCAGCAGAAGCCAA 
AATTATTGATATCCTAGAACACAGTTTGACAACAGTrGTCGGGCAAATCGTTCTGGATCAGGAAAAA 
GCrGGCTATATTCCTTCAAAAAATCAGAAAATCAGTCAACCGATTTATGTTAAGAAAC^ 

GGAACAGAAGTTCTCAAAGTCTTTATCGATAAATACCCAAGCAAGAAACATGArriCl I 1 GTCGCGAGTGTTCTCG 
ATGTAGTGGG ACaCTCAACGGATGTCGG AATTG ATGT T C i I G AGGTCTTGG AATCAATGG ACATTGTATCCG AGTT 

10 TCCAGAAGCTGTTGTTAAGGAAGCAGAAAGTGTCCCTGATGCTCCGTCTCAAAAGGATATGGAAGGTCGT^ 

TCTAAGAGATGAAATTACCmACCATTGACGCTGCGGATGCCAAGGACTTGGACGATGCAGTGCATATCAAGGC 
TCTGAAAAATGGCAATCTGGAGTTTGGGGTTCACATCGCAGATG I 1 I L I lATrATGTGACCGAGGGGTCTGCCCTT 
GACAAGGAAGCCCTTAACCGTGCGACTTCTGTTTACGTGACAGACCGAGTGGTGCCAATGCTTCCAGAACGACT 
TCAAATGGCATCTGCTCTCTCAATCCCCAAGrrGACCGCCTGACCCAGTCTGCTATTAT 

15 GGAGATTGATAAACATGGTCGTGTGGTCAACTATACCATTACACAAACAGTTATCAACACCAOI 1 1 ICGTATGACC 

TATAGCGATGTCAATGATATCCTAGCTGGCGATGAAGAAAAGAGAAAAGAATATCATAAAA TTGTA TCAAGTATC 
GAACTCATGGCCAAGCTTCATGAAACTTTAGAAAACATGCGTGTGAAACGTGGAGCTCrCAATT^ 
AAGCGAAGATTTTAGTGGATAAACAAGGTAAGCCTGTTGATATCLi i I CrrCGGCAGCGTGGTATT GCCGA GCGGA 
TGATTGAGTLi i J i ATGTTGATGGCTAATGAAACAGTTGCCGAACATTTCj\GCAAGTTGCATTTGCClUU'i ATCT 

20 CGAATrCACGAGGAGCCTAAGGCTGAAAACGTTCAGAAGTTTATTGATrATCCTTCGAGTTTTGGC^ 

ATGGAACTGCCAGTGAGATTAGTCAGGAGGCACTTCAAGACATCATGCGTGCTGTTGAGGGAGAACCTTATGCAG 
ATGTATTCTCCATGATGCTTCTTCGCTCTATGCAGCAGGCTCGTTATTCGGAGCACAATCACGGCCACTATGGACT 
AGCTGCTGACTATTATACTCACTTTACCAGT CCAAT TCGTCCTTATCCAGAC Ci ' l Ci ' l O I f CACCGTATGATTCGGG 
ATTACGGCCGTTCTAAGGAAATAGCAGAGCArrriGAACAAGTGATrCCAGAGATTGCGACCCAGTCTT 

25 GTGAACCTCCTGCCATAGAAGCTGAGCGTGAAGTCGAAGCCATGAAAAAGGCTGAGTATATGGAAGAATACGTGG 
GTGAAGAGTATGATGCAGTTGTATCAA GTATT GTCAAATTCGGTCTCTTTCTCGAATTGCCAAACAC^ 
CTTGATTCACATCACTAATCTGCCTGAATTTTATCATTTCAATGAGCGTGATTTGACTC^ 

GGTATCACTTTCCGAGTGGGTCAGCAGATCCGTATCCGTGTTGAAAGAGCGGATAAAATGACTGGAGAGATTGAT 
TTTTCATTCGTACCTAGTGAGTTTGATGTGATTGAAAAAGGCrrcAAACAGTCTAGTC^ 
30 AGTGGCAGAGGGCGTGATTCAAATCGTCGTTCGGATAAGAAGGAAGACAAGAGAAAATCAGGACGCTCAAATGA 
TAAGCGTAAGCATTCACAAAAAGACAAGAAGAAAAAAGGAAAGAAACCTTTTTACAAGGAAG^ 
GAGCCAAGCATGGCAAAGGGCGAGGGAAAGGTCGTCGCACAAAATAA 

4137.2 

35 ATGGGCACAACAGGATTTACAATAATTGACTTAATTATCITGATTGTTTATTTACTT^ 

CTATTTCTCTAAAAAAGAGATGAAAGGAAAAGAG 1' 1 CI I'l AAAGGAGATGGTTCGGTTCCTTGGTATGTTACTTCG 
GTATCCAl 1 i 1 IGCCACAATGCTCAGTCCGATTrCCI'rcri'GGGACTCGCTGGTAGCTCTTATGCAGGTAGCTGGAT 
TTTATGGTTTGCTCAATTAGGGATGGTAGTaGCTATTCCACTGACAATTC GI 1 i l ATCTTACCTATCrTTGCACGGA 
TAGACATCGATACGGCATATGATTACTTGG ATAAACG ril'l AATTCTAAAGCACTTCGTATTATTTCAGCACTCTT 

40 GTTTATTATTTATCAATTGGGACGTATGTCTATCATTATGTACCTCCCATCAGCTGGTTTATCAGTATTGACAGG^ 

ttgacatcaat atttt gattattttgatgggtgtagtrgcaattgtttattcrr 
ttatggaca gact ttattcaaggtgtgattctgattagtggtgtcgttttagctt^ 

TAAj\GGTGGCTTTGGTGCAGTAGCAGAAACATrAGCAAACGGGAAATTCCTTGCTGCAAAT^ 

TCCTAACTTGCTTTCAAACTCCATLl 1 1 1 i AATTGTGATGGCTTCAGGCTTTACAATCTTGTCTTCCTATGCTTCATC 
45 TCAAGATTTGGTTCAACGTTTTACTACAACACAAAATATTAAGAAACTTAATAAGATGTTGT^ 

ttgtcacttgcaactgcaacagtcttttacntgarrggtacaggctrgtacgtattctatcaact 
atagtgcagctagcaatatccctcaagaccaaatctttatgtactttattgcataccacttaccagtagct 
aggtttgatcttggcagcgatttatgcagcatctcaatcaactatttcaacaggtttgaactctg^ 
tggacattgcatattcaagatgtcatttcraaaaatatgtcagacaatcgtcgtacgaaaattgcacaattcc^ 

50 ctcragcagtaggtttattctcaattggtgtttccattgtcatggctcactcagatattaaat^ 

ttcaatagtrrcatgggacttgtacttggtctacttggtgctgtatttattcttgga i 1 igl 1 i ctaaaaaagcaaa 
taaacaaggtgcttatgcagcgctgattgtatcaaccatcgtcatggtatttattaaatacttcc 
gctgttagcractgggcatattcattgatttcaatctctgtatcagtagtttcaggttatattgtat^ 
tggaaataaagtatctgcacctaaatatacaacgattcatgatattacagaaattaaagcggattcaagttggga 

55 agttcgtcactaa 

4138.1 

ATGAAATTTACTAAAAAATATATACCAGCTGGATCAGCTGTTATCGTATCCrTGAGTCTATCTGCCT 
ACCAGCATCGTTCGCAGGAAAATAAGGACAATAATCGTGTCTCTTATGTGGATGGCAGCCACTCAAGTCAGAAAA 

60 GTGAAAACTTGACACCAGACCAGGTTAGCCAGAAAGAAGGAATTCAGGCTGAGCAAATTGTAATCAAAATTACAG 
ATCACGGCTATGTAACGTCACACGGTGACCACTATCATrACTATAATGGGAAAGTTCCTTATGATGCCCTCTTTA^ 
TGAAGAACTCTTGATGAAGGATCCAAACTATCAACTTAAAGACGCTGATATTGTCAATGAAGTCAAGGGTGGTT 
TATCATCAAGGTCGATGGAAAATATTATGTCTACCTGAAAGATGCAGCTC\TGCTGATAATGTTCGAj\CrAAAGAT 
GAAATCAATCGTCAAAAACAAGAACATGTCAAAGATAATGAGAj\GGTTAACTCTAATCTTCCTGTAGCAj\CCT 

55 CAGGGACGATATACGACAAATGATGGTTATGTCTTTAATCCAGCTGATATTATCGAAGATACGGGTAATGCTTATA 
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TCGTTCCTCATGGAGCTCACTATCACTACATTCCCAAAACCCATTTATCTGCTAGTGAATTAGCAGCAGCTAA^ 
ACATCTGGCTGGAAAAAATATGCAACCGAGTCAGTTAAGCTATTCTTCAACACCT 
TGTAGCAAAAGGATCAACTAGCAAGCCAGCAAATAAATCTGAAAATCTCCAGAGTCr^^ 
TTCACCTAGCGCCCAACGTTACAGTGAATCACATG GCCT GGTCTTTGACCCTGCTAAGATTATCACTCGTA^ 
5 AATGGAGTTGCGATTCCGCATGGCGACCATTACCACTTTATTCCTTACACCAAGCTTTCTrGCC^ 
TTGCCAGAATGCTGCCTATCAGTCGAACTGGTTCTACACTT^ 

TCTAGGCAGTCTTTCAAGCAATCCl lei ICII I AACGACXAGTAAGGAGCTCTCTrCAGCATCTGATCGTTATATTT 
TTAATCCAAAAGATATCGTTGAAGAAACGGCTACAGCTTATATTGTAAGACATGGTGATCATTTCCATTACATTCC 
AAAATCAAATCAAATTGGGCAACCGACTCTTCCAAACAATAGTCTAGCAACACCTTCTCCATCT 
10 CCAG GAACT TCACATGAGAAACATGAAGAAGATGGATACGGATTTGATGCTAATCGTATTATCGCTGAAGATGAA 
TCAGGTTTTGTCATGAGTCACGGAGACCACAATCATTATTTCTTCAAGAAGGACTTGACA 
GTGCGCAAAAACATTTAC 

4139.1 

15 

ATGAAAAAAAGAGCAATAGTGGCAGTCATTGTACTGCTTTTGATTGGGCTGGATCACTTGGTCAAATCCTATATCG 
TCCAGC AGATT CCACTGGCTGAAGTGCGCTCCTGGATCCCCAATTTCGTTAGCTTGACCTACCTGCAAAATCGAGG 
TGCAGCCTTTTCTATCTTACAAGATCAGCAGCTGTTATTCGCTGTCATTACTCTGGTTGTCCTCATA 
GGTATTTACATAAACACATGGAGGACTCAT^ 

AAACTTTATTGACAGGCTCACTCAGGGCTTTGTTGTGGATATGTTCd\CCTTGACTTT^ 
20 ATGTGGCAGATACCTATCTGACGGTTGGAGTGATTATTTTArrGATTGCAATGCTAAAAGAGGAAATAAATGGAA 
ATTAA 

4139.5 

ATGAATACAAATCTTGCAAGrrTTATCCTTGGACTGATCATCGATGAAAACGACCGTTTTTAC^^ 
25 ATGGTCAAACCTATGCTCTTGCTAAGGAAGAACGCCAACATACACTAGGGGATACGGTCAAAGGTTTTCCATAC^ 
CGGATATGAAGCAAAAACTCCGCCTGAC AACC rrAGAAGTGACTCCCACTCAGGACCAATrrGGTTGGCGACGTG 
TCACAGAGGTTCGTAAGGACTTGGGTGTCTTTGTGGATACAGGCCTTCCTGACAAGGAAATCGTTGTGTi^ 
TATTCTCCCrGAGCTCAAGGAACTCTGGCCTAAGAAGGGCGACCAACTCTACATCCGTCTT^ 
AGACCGTATCTGGGGCCTCTTGGCTTATCAAGAAGACTTCCAACCTCTTGCTCGTCCTGCCTAC^ 
30 AA CCAAAA CTGGCCACCCATTGTTTACCGTCTCAAGCTGTCaGGAA CI 1 I iOrn 'ACCTACCAGAAAATAATATGC 

TTGGTTTTATTCATCCTAGCGAGCGTTACGCACAGCCACGTTTGGGGCAAGTATTAGATGCCCGCGTTATTGGT^ 
C CGTG AAGTGCACCGCACTCTGAACCTCTCCCTCAAACCACGCTCCTTTGAAATGTTGGAAAACGATC 
ATjrTGACTTATTTGGAAAGCAATGGCGGTTTCAT^ 

CCTTTGGCATTTCTAAAGGTCAGTTCAAGAAAGCTTTAGGTGGTCTTATGAAGGCTGGTA^ 
35 GTTTGGGACAGAGTTGATTTAG 

4139.8 

atgaaagatgttagtct atrttt attgaaaaaagttttcaaaagccgcttaaact 
atctgtactcggtgttaccttttatttaaatagtcagactgcaaactcacacagcttggagag^ 
40 cgcattcc agcc aacgagagggctatcaatgaaaatgaagagaaactctcccaaatctctgataccacctcggag 
_ gaataccagtttgctaaaaataatttagacgtgcaaaaaaatcttttgacgcgaaa^ 
taaaagaagggcgctggaaagaagcctactatttgcagtggcaagatgaagagaagaattatgaatrtgtatcaa 
atgacccgactgctagcc ctgg ctt aaaa atgggggttgaccgcgaaccgaagatttaccaagccctgtatccct 
tgaacataaaagcacatactttggag tttc cgacccacgggattgatcagattgtctggattttagaggttatcat 

45 CCC>u\GTTTGriTGTGGrrGCTATTATXTTrATGCTAACACAACrATTTGCAGA>^ 
ACAGCTCACTTATATCCTGTTTCAAAACT 

TGCTGTTTATCGGAATCTGTGGLl 1 I ILl'l I I CTAGTGGG AAGTCTG ATAAGTGGTTTTGG ACAGTTAGATTATCCC 
TAC CCAA TTTATA GCTT AGTGAATCAAGAAGTAACTATTGGGAAAATACAAGATGTArTATTTCCTGGCTTGCT 
TA GCTTTCIT AGCCTTTATCCTCATTGTGGAAGTTGTGTACTrGATTGC^ 
OV CTCTTTCTT7rA£rCATTGGGATTGTTGGCTTATT 

TCTGATTCCCTTTACTTACTTGCGTTCAGTGGAGATTTTATCTGGAAGATTACCTAAGC^ 

TAAATrGGAGCATGGGAATGGTCTTACTTCCTTGCCTGATTATCTTTTTGCTATTGGGAATTCT 

TGGGGAAGTTCACAGAAAAAAGAATTTTTTAATAGATTCTAG 

55 4141.1 

ATGATGAAGTTCATATTGGATATTGTTAGTACACCAGCTATTTTAGTAGCTTTAATTGCAATCI^ 

TCAGAAGAAGAAATTACCT GATA TTATTAAAGCTGGAATTAAGACCTTTGTTGGTTTCT^ 

GCAGGAATTGTACAAAATTCTTTAAA TCCA TTTGGTACCATGTTTGAGCATGCTTTTC^ 

GAATAATGAAGCAATTGTAGCTGTAGCTTTAACAACATATGGCTCAGCTACTGCAATGATTATGTTTGCAGGCATC 
OU GTGTTCAATATCTTAAT CGCT CGTTTTACTCGATTrAAATATATTTTTTT^^ 

ATGTATGATTGCCGTCATTTTATCAGTTGCTGGCTTTACTAGCITGCCTCTCATCT^ 

GTATTATTATGAGTATTTCCCCAGCATTTCTGCAAAAATATATGGTTCAATTAACTTC 

AGGTCATITCAGrrCTTTGGGATATTCGTTGAGTGGTTTrACT^ 

GACGACATTAAATTTCC AAAG AGTTTAGC f 1 I" ri'rACCTG ATAGTACTGTTAGTATTACTTTATCCATGGCACTTAT 
OD T^ACATTATTGTAGCTATCTTTGCAGGCTCAGAATATATAGAAAAAGAAATCAGTAGTGGTACAAGTGGT^ 
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TATGCrrTACAA TTAG CAGGTCAATTTGCAGCAGCGGTAl 1101 1 ATTTTaGCAGGTGTTCGCCTTATTTTGGGCGA 
AATTGrrCCAGCCTTTAAAGGTATTTCAGAGCGTCTrGTACCTAATTCAAA^ 

ATACTTATGCACCCAATGCAGTTCTAATTGGATTTATCTCTAG r I 1 lb I i GGTGGTTTAGTAAGTATGGTAATTATG 
ATTGCTTCAGGAACGGTTG 1 1 ATCTTACCAGGTGTTGTGCCTCA 1 11 Cn CTGTGGAGCGACTGCAGGTGTCATTGG 
5 GAAT GCATCTG GTGGT GTTCCTGGAGCCACTATTGGAGCA l i i 1 I ACAAGGTATnTAATCA ari riCM CCAGTCT 

T TTTA ATGCCAGTTTTGGGAGGACTTGGTTTCCAAGGATCAACTTTCTCAGA 1 1 1 GGTCTATCaGGAATT 

ATTTTAGGA ATGT TAAATCAATTTGGCTCACAAGCAGGCATTGTGATTGGT Ci ' lG T ICi i ATTCTAGCAGTr ATGTT 
TGGAGTATCCTTTATTAAAAAGCCATCTGCAACGGACGAATAA 

10 4142.3 

ATGATTAAAACATTTCTCTCTGCCCTTTCGGTCATTCT Ll 1 ! 1 CTATCCCTATCAT AACTTATT C 1 1 Ml I CCCATCT 
TCTAATCT TAAC ATTTGGCTA TCTACCCAA CCTATCTTGGCACAGATTTATGCCTTCCCCT^ 

GGCT GCTATTTTAAGl 1 iCn A i 1 i 1 i 1 1 i CCTAT Ci i 1 1 1 ACAAGAAAAATAAACAAATACG UI ' 1 i l ACTCTGGCA 
TTTTGCTCTTACTATCGCTCATATTACTATTATTCGGAACAGATAAAACC CI l iCi i ctgcatcaaataagactaaa 
15 ACCTTAAAATTAGTAACTTGGAACCTCGCTAATCAAATAGAAGCACAACATATTGAGCGAATTm 

A CGCC GATATGGCTATA TTCC CTGAACTAGCTACCAAT ATCAG AGGTGAGCAAGAAAACCAGAGAATCAAACTAT 
TGTTTCATCAAGTTGGACTTTCTATGGCCAACrATGATATTTTCACTTCTCCACCTACCAAT^ 

GTGACTGTGATTGTCAAGAAAAGTTATGGTTTCTATACAGAAGCTAAAA CJ i ] ' 1 CATACAACACGGTTCGGGACAA 
TTGTATTACATTCGAGAAAACAAAATATACCAGATATCArrGCCTTGCATACTGCGCCTCCTCTGCCAGCTTTAAT 
20 GG AAAT CTGGAAGCAAGACTTAAACATCATTCATAATCAATTGGCrrCAAAATATCCAAAGGCT 

G ATTTT AATCCAACTATGCGTCATCCAGCACTrGCAAAAATAAGCTCTCATAGGGACGCATTAAATGCACTG 
CCTTTTGAAAGAGGAACTTGGAATAGCCAAAGTCCAAAACTTTTTAATGCAAC 

AAA ACCAC TACT ATGTT AAAGATTTAGACATTGTAAGTTTTCAAAACTCTGATCATAGATGTAT^^ 
CACATTTTAA 

25 

4142.4 

atgaatccaatccaaagatcttgggcnt^vtctcagcagaaagcgactgagaag 

tggtcrrarrggccggaatp*cagcctgtttgactctgatgaagtccaacaaaac^ 

atcacrcaatacatctttttctattaagaagatagagaatggtcagacarrc^agttgtcagaccta 

30 agcaagattaaggggctggaaaatgtctctcctgaacttgagacggtcgcaaaactaaaagacaaggaagca^ 
actggcgagcagagcgtggagratgatgatttatcagctgcagacaataacttgg 
gattcatccaaggatgtaacctttaccagttcggctttcaatctaaaagaagggcgacacc^ 
tccaagaaaatccttatccacgaagaattggctaagaagaacggtcrtrcgcttcatgac^ 
ctggtcagtctgaatctggaaaaggagaaacagtagagrttgagattatcggcatcttrrctggtaaaaaac^ 

35 agaaattcacaggcttgtcttctgacttcactgaaaatcaactctrtacagactatga>^ 

gggcaatagtgaagctcaagtcagtgcagcacgcttctatgtagaaaatcctaaggaaatggacggactcatgaa 
gcacgtagaaaacttggccttggaaaatcaaggctaccaagtcgaaaaggaaaacaagc^^ 
agactcagttpcaactrrccaaaccttcctgaccatcttcctttatgggatgt^ 
* attctggttttgtctctctggttg ag ag aacgggtctatg aagtggggattttacttgcacttggaaaaggcaaga 

40 gctcgatcttcctacaattctgtttagaggtagttttggtatctcttggagctrtgc^ 

aacgcaatcac aac ttacctactccaaactctactagcaagtggagatcaggcaagcntacaagatao\ctagc 
aaagca agcag rrtatcaactagcatcttatcttttgcagaatcctatg 1 i ' itr ctagttctgcttagttgcttatc 
tgtaggcctttgtttcctattcttatttagaaaatcaccgaaagaaattttatcatctattagtt^ 

45 4142.5 

atgttacacaacgcatttgcctatgttacaaggaagtttttcaaatcgattgtcatcttcctgattattct^ 
ggcgagcttgagtttggtcggcrrgtcaatcaagggagctactgccaaggcttctcaggagaccritaaaaata^^ 
acc>atagcttctccatgcaaatcaatcgtcgcgtcaaccaaggaacgccrcctggtgctgggaatatcaagggt 
gaagacatcaaaj-laaatcaccgaaaacaaggccattgagtcrratgtcaaacctatcaacgctatcggagarr^ 
du actggatatgacctgattgaaacgccagaaaccaagaagaatctcactgctgatcgtgccaagcgttttggaagt 
agcttgatgattacaggtgtcaatgactcctctaaagaagacaagtttgtctctggttcttataaactagtcgaag 
gagagcacttaaccaacgacgacaaggataaaatcctcttgcacaaggacttggcagccaaacacggctggaaa 
gtaggggacaaggttaaa ctgg actctaatatctacgatgcagataatgaaaaaggagccaaggaaacagttga 
agtgacaatcaagggactctttgatggtcat aata agtcagcagtaacctactcacaagaacrrracgaaaacac 
ag ctatt acagacattcacactgctgcaaaactttatggatacacagaagacacagccatttatggggacgcaac 
cttctttgtaacagcagacaagaacttggatgatgttatgaaagagttgaatggcatcactggtatcaactgoaa 

GAGCTACACACTCGTCAAGAGCTCCTCTAACTACCCAGCTCTTGAGCAATCTATCTCTGGTATGTACAAGATGGCC 
AACCTCCTCnrCrGGGGTAGCTTGAGCTTCTCACTTCTCCTCCTTGCCCTCTTGCTC^ 
TCGCAAGGAAGTGGGAATTCTCCTCTCTATCGGCCTCAAGCAGGCAAGTATCTTGGGTCA 
tU ATrCATCACCCAATCTATCrrGATTGCTATCCCTGCTCTAGrrrCTGCrTACTrCCTAGCTA^ 

caatrggaaacactgtccttgccaatgt gacr rcaggtgttgccaaacaggctagtaaggcggctca^ 
acct tcgtgg tggtgc agaactagatggctttagcaagaccttgtcgagcctagacatttccattcagacatcaga 

CnTTATCATCATTTTrGTCCrrGCCTTGGTTCTAGTGGrrCTCGTTATGGCGCrrGCTTCAA^ 
AACAACCAAA AGAGCrCTTGCTGG ATGCTG AATAA 
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4144.1 

ATCTCACAGGATAAACA.\ATGAAAGCTCTTTCTCCCCTTCTGCAGCGAGTTATCAATATCTCATC^ 
GGGTTGGGAGTTTGATTTrCTCTATTTGGGCTTATCAGGCTGGGATTTTACAATCCAAGGAAAC^ 
ATCCAGCAGCCAGGCATCTCGGCrCCACCTCTCTTTATCri I I 1 ACAGATTTTACAGACTGTCGTCCCTATCATTCC 
5 AGGGGCCTTGACCTCGGTGGCTGGGCTCTTTATCTACGGGCACATCATCGGGACTATCTACAACTATATCGGCATC 
GTGATTGGCTGTGCCATTAT UI 1 Vi ATCTAGTGCGCCTATACGGAGCTCCCrrTGTCCAGTCTGTCGTCAGCAAGC 
GCACCTACGACAAGTACATCGACTGGCTAGATAAGGGCAATC OI i 1 I GACCG Ll iLIil A ni 1 i ATGATGATTTG 
GCCCATTAGCCCAGCTGACnTTCTCTGTATGCTGGCTGCCCTGACCAAGATGAGCTTCAAGCGCTACATGACCATC 
ATCATTCTGACCAAACCCTTTACCCTCGTGGTTTATACCTACGGTCTGACCTATATTATTGACl 1 1 1 ICTGGCAAAT 
10 GCTTTGA 

4144.2 _ 

atgagaaatatgtgggttgtaatcaaggaaacctatcttcgacatctcgagtcatggaultrci icl i f atggtga 
tttcgccgttcctcrrntaggaatctrcrgtaggaattgggcatctccaacgttc^ 
15 gtggcagtagtgacaacagtgccatctgtagcagaaggactgaagaatgtaaatggtcttaacttcgactataaa 
gacgaj\gc\agtgccaaagaagcaattaaagaagaaaaattaaaaggttat^ 
gttctaaaggcagtttatcatggcgaaacatcgcttgaaaatggaattaj\attr^ 
aacrgcaaaatcagcttaatcgttcaactgcttcctrgtctcaagagcaggaaaaacgcttagc^ 

AATTCACAGAAAAGATTGATGAAGCCAAGGAj\AATAAAAAGTTTATTCAj^ACAATTGCAGCACG^ 
20 T Ll 1 ICl 1 I ATAT GATTC TGATTACCTATGCGGGTGTAACAGCTCAGGAAGTTGCCAGTGAAAAAGGCACCAAAAT 

TATGG AAGTCO 1 I'i 11 ICTAGCATAAGGGCAAGTCACTATTTCTATGCGCGGATGATGGCTCTGTTTCTACTGATTT 
TAACGCATATTGGGATCTATGTTGTAGGTGCTCTGGCTGCCGTTTTGCTCTITAAAGATTTGCCATTC^ 
TCTGGTATTTTGGATCACrrGGGAGATGCTATCTCACTGAATACCTTGCTCrrTATrrTGA i I ICA TGTA 

cgtagtcttggc^cccttcctaggatctatggtttctcctcctgaggactcagggaaagccttct 
25 attttgattatgggtgg i l l 1 1 i i ggagtgacagctctaggtgcagctggtgacaatctcctcttgaagattggttc 

TTATATtCCCTTTATTTCGACC TTCrrr ATGCCGTTTCGAACGATTAATGACTATGCG^ 

ATTTCACTTGCTATTACAGTGAI I i \ IGCGG TGGT AGCAACAGGATTTATCGGACGCATGTATGCTAGTCTCGTTCT 
TCAAACGGATGATTTAGGGATTTGGAAAACCTTTAAACGTGCCTTATCTTATAAATAG 

30 4144.3 

ATGACAGAAACCATTAAATTGATGAAGGCTCATACTTCAGTGCGCAGGTTTAAAGAGCAAGAAATTCCCC^ 
GACTTAAATGAGATTTTGACAGCAGCCCAGATGGCATCATCTTGGAAGAATrrCCAATCCTACTCT 
TACGAAGTCAAGAGAAGAAAGATGCCTTGTATGAATTGGTACCTCAAGAAGCCATTCGCCAGTCTGCTG I'l 11 CCT 
TCTCnTTGTCGGAGATTTGAACCGAGCAGAAAAGGGACCCCGACTTCATACCGACACCTTCCAACCCCAAGGT 
35 GGAAGGTCTCTTGATTAGrrCGGTCGATGCAGCTCTTGCrGGACAAAACGC Li 1 U 1 i GG GAG CTGAAAGCTTGGGC 

tatggtggtgtgattatcggtttggttcgatacaagtctgaagaagtggcagagctctttaaccracctgactaca 
cctattc tgtc tttgggatggcactgggtgtgccaaatcaacatcatgatatgaaaccgagactgccactagaga 
atgttgtctttgaggaagaataccaagaacagtcaactgaggcaatccaagcttatgaccgtgttcaggctgact 
atgctggggcgcgtgcgaccacaagctggagtcagcgcctagcagaacagtttggtcaagctgaaccaagctc^ 
40 ctagaaaaaatcttgaacagaagaaattattgtag 

4146.1 

ATGTTAAAj\CrTATTGCTATTGTTGGAACAAATTCAAAACGTTCTACAAACCGTCAATT 
Aj\CACTTTACTGACAAAGCTGAAATrGAACTTGTTGAAATCAAGGCCATTCCTGTCTrCAAC^ 
45 GCAAGTACCTGCTGAAATATTGGAAATTGCTGCTAAAATCGAAGAGGCAGATGCCGTTATTATCGGTACTCCTGA 
GTATGATCACTCTATTCCAGCTGTTrrGATGAGCGCTCTTGCTTGGTTGTCTTATGGTATT^ 

Aj\CCAj\TCATGATTACAGGTGCTTCTrACGGTACGCTTGGTTCATCTCGTGCCCAArrGCAACTTCGTCAj\AT^^ 
GAATGCTCCTGAAATCAAGGCAAAT Ol ILi 1 CCAGATGAATTCTTGCTCTCACACTCTCTTCAAG CATT TAACCCA 
AGTGGCGACTTGGTTGACCTTGATGTTATCAAGAAATTGGATGCCATCTTTGATGACTTCCGTATCTT^ 
50 TCACAGAAAAATTACGTAATGCACAAGAATTACTTCGCAAAGATGCTGAAGACTTTGACTGCGAAAAT^ 

4146.2 

ATGAATACCTATCAATTAAATAATGGAGTAGAAATTCCAGTATTGGGATTTGGAA Cri i i AAGGCTAAGGATGGA 
GAAGAAGCCTATCGTGCAGTGTTAGAAGCCTTGAAGGCTCGTTATCGTCATATTGATACGGCGGCGArrTA TCAGA 

55 ATGAj^GAAAGTGTTGGTCAAGCAATCAAAGATAGCCGAGTTCCACGTGAAGAAATGTTCGTAACTACCAAGCrrT 
GGAATAGTCAGCAAACCTATGAGCAAACTCGTCAAGCTTTGGAAAAATCTATAGAAAAACTGGGCTTCGATT^ 
TGGATTTGTATTTGATTCATTGGCCGAACCCAAAACCGCrCAGAGAAAATGACGCATGGAAAACTCGCAATG^^ 
AAGTTTGGAGAGCGATGGAAGACCTCTATCAAGAAGGGAAAATCCGTGCTATCGGCGTTAGCAAI J I ICl ICCCC 
ATCATTTGGATGCCrrGCTTGAAACTGCAACTATCGTTCCTGCGGTCAATCAAGTTCGCTT 

60 TCAAGATCAAGTCGTAGCTTACTGTCGTGAAAAGGGAATTTTATTGGAAGCTTGGGGGC Cl i 1 1 GGACAAGGAGA 

ACTGTTTGATAGCAAG CAAGT CCAAGAAATAGCAGCAAATCACGGAAAATCGGTTGCTCAGATAGCCTrGG CCTG 
GAGCTTGGCAGAAGGATTTTTACCACTTCCAAAATCTGTCACAACCTCrCGTATTCAAGCTAATCTTGAT^ 
GGAj^TTGAACTGAGTCATGAGGAGAGAGAAACCTTAAAAACGATTGCTGTTCAATCGGGTGCTCCACGAGTTGAT 
GATGTGGATTTCTAG 
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4147.1 

ATGAGGTGCAAAATGCTTGATCCAATTCCTATTCAACTACGACCCCTAGCCATTCGTTGGTATGCC^ 
TGACAGGCTTGArrCTTGCGGrrrATTTGACCATGAAAGAAGCACCTAGAAAGAAGATCATACCAGACGATAT^ 
AGATTTrATCTTAGTAGCCTrrCCCTTGGCTATTTTAGGAGCTCCTCT 
5 TAGTCAGAATTTAGGAGAGA l 1 I 1 I GCCATTTGGAATGGTGGTTTGGCCATTTACGGTGGTTTGATAACTGGGGCT 

CTTGTGCTCTATATCTTTGCTGACCCTAAACTCATCAATACTTGGGATTTr 

GArrGCTCAAAGTTTGGGCCGTTGGGGTAAl i 1 L U I AACCAAGAAGCTTATCGTGCAACAGTGGATAATCTGGAT 
TATCTACCTGGCTTTATCCGTGACCAGATGTATATTCAGGGGAGCTACCGTCAACCGACTTTCCnT^ 
TATGGAATCTGCTTGGCTTTGCCTTCATTCTGAI l i 1 1 AGACCGAAATGGAAGAGTCTCAGACGAGGTCATATCAC 
10 GGC LI i 1 1 ACTTGATrrGGTATGGTTTCGGTCGTATCGTTATCGAAGGTATGCGAACAGATAGTCTCAT Cjl iCU CG 

GCTTTCGAGTGTCCCAATGGCTGTCAGTTGTCCrrATCGGTCTCGCTATAATGATCGTTAT^ 
GAAGGCCCCTTACTATATTACAGAGGAGGAAAACTAA 

4147.2 

15 ATGGGTAAATTATCCTCAATCCTTTTAGGAACCGTTrCAGGTGCAGCTCTTGCC^ 

GCAAACAAGTTTGCAGTCAGGCTCAAGATTTTCTAGATGATTTGAGAGAAGATCCGGAGTATGCCAAGGACCAAG 
TCTGTGAAAAACTGACAGAAGTTAAGGAGCAGGCTACAGAl l I I GTTCTGAAAACAAAAGAACAGGTTGAGTCAG 
GTGAAATCACTGTGGACAGTATACTTGCTCAAACTAAATCCTATGC I ITl CAAGCGACAGAAGCATCAAAAAATC 
AATTAAATAATCTCAAGGAGCAATGCCAACAAAAAGCCGAAGCTCTTCATCACrc:^^ 

20 T AACAG AAG AAT AA 

4147.3 

ATGAAAACTAAATTGATCrrTTGGGGCTCTATGCTCTTTCTCCTCTCCCTCT 
TTGGATTTTCTATCCTATGGAGATTCAGTGGCTAAACnTAj\CGAATCGAGTCTATCTAAAACC^ 

25 TACAATTTTCaTATCTTGATGAATTATCTGACCAATCLI I 1 1 AGTCAGGTCTTACAGATGCCTGATTTTCCTTCGtC 

AGCAGCTGGTCTG CACCA TTTCCCAGTGGTCAAGAATCTCTTTCATTTGGTTCAGCTAGTAGCT 
CCAAGTTTCTATCTCTTTGTCAATAGGATTGTGAA AAAG GALI il l 1 GTCTCTTTATCGAAAAAGTCTCCTGGCTCT 
AGTA GTCT TACCTGTGATGATTGGACTTGGGGGACTTTTGATTG G T 1 i 1 GACCAATTCTTTACT Ci 1 i i CCATCAAA 
TrCTCTTTGTGGGAGATGATACCTGGCTTTTTGATCCAGCCAAGGATCCTGTTATTATGAT^ 

30 rriCnCATGCCTTCCTCCTCl 1 i'll'IGCCCTCTATGAAAACriCri'lGGCTATCTGTATCTCAAAACTCGTAGGAA 

GTGA 

4149.1 

ATGACTTATCATTTTACTGAAGAATACGATATTATTGTAATTGGTGCGGGACACGCTGGGGTTGAGGCT^ 
35 CCGCTAGCCGTATGGGCTGTAAGGTCCTGCTTGCGACCATCAATATTGAAATGCTGGCTTTCATGCCrrGTA^ 

CTCTATCGGTGGTTCTGCCAAGGGGATTGTCGTGCGTGAAGTCGATGCCCTCGGTGGCGAGATGGCCAAAACCATT 
GACAAGACTTACATCCAGATGAAGATGCTAAACACAGGGAAGGGGCCAGCrGTCCGTGCCCTTCGTGCGCAGGCT 
GACAAGGAACTTTACTCTAAGGAGATGCGCAAGACGGTTGAAAACCAAGAAAATCTGACCCrrCGTC/^^ 
ATTGATGAGATTTTGGTGGAAGATGGCAAGGTTGTCGGTGTGCGTACAGCCACCCATCAAGAATATGCTGCTAAG 
40 GCTGTTATTGTGACGACAGGGACTGCTCTCCGTGGGGAAATTATCATCGGAGACCTCAAGTACTCATCAGGTCCTA 
ACCACAGCTTGGCTTCTATTAACCTAGCTGACAATCTCAAGGAACTGGGTCTCGAAATCCGTCGTT^ 
AACCCCTCCACGTGTCAAGGCrrCTTCTATCAArrACGATGTGACAGAAATTCAGCCAGGAGACGAAGTGCCTAAT 
CATTTCTCATACACTTCACGTGATGAGGATTATGTCAAGGACCAAGTACCATGCTGGTTGACCTATACCAATGGTA 
CCAGTCATGAGATTATCCAAAACAACCTCCACCGTCCGCCTATGTTTACAGGTGTGGTCAAGGGAGTGGCGCCTC 
45 GTTACTGTCCGTCGATTGAAGACAAGATTGTGCGCTTTGCGGACAAGGAACGTCACCAACTCTTCCT^ 

AGGGCGCAATACTGAGGAAGTCTATGTGCAAGGACTTTCAACCAGTCTGCCTGAGGATGTCCAGCGTGACnT^ 
GCATTCCATCAAAGGTTTGGAAAATGCAGAGATGATGCGGACAGGTTATGCTATTGA 
GTATGATATGGTCTTGCCTCATCAGTTGCGTGCGACTTTGGAAACCAAGAAAATCTCAGGTCTC^ 
CAGACAAATGGAACATCAGGTTACGAAGAGGCAGCAGGCCAAGGGATTATCGCGGGTATCAATGCGGCTCTGAA 

50 aatccaaggcaagcctgaattgattttgaagcgcagtgatggttatatcggggtgatgatajacgacttggtg 
caagggaaccattgaaccctaccgtctcttgaccagtcgtgctgaataccgtctcattcttcgtcatgacaatc 
gatatgcgcttgactgagatgggacgcgagattggccttgtggacgatgaacgctgggctcgttttgaaatca^ 
aaaaatcaatttgataatgagatgaagcgcctagacagtatcaaactcaagccagtcaaggaaaccaatgccaag 
gttgacgagatgggcttcaaacccttgaccgatgcagtgacagccaaggaattccttcgccgtccagaa u tl i ci 1 

55 ACCAAGATGTGGTGGCCrrCATCGGACCAGCTGCAGAAGACTTGGATGACAAGATTATCGAATTGATTGAAACAG 
AAATCAAGTATGAAGGCTATATTTCCAAAGCCATGGACCAGGTTGCCAAGATGAAACGCATGGAAGAAAAACGCj^ 

ttccggccaatatcgactgggatgacattgattctatcgcaaccgaagcccgtcagaagttcaaactcatcaatcc 
agaaaccatcgcccaagccagccgtatttccggagtaaacccagcagatatttctattttgatggtgtatctggaa 
ggtaaaaatcgtagtatttctaaaactcttcaaaaatcaaaatga 

60 



4149.2 

ATGAAAGTATTAGCl 1 1 1 GATaCGTCCAGCAAGGCTCTTTCTCTGGCTATTTTAGAGGATAACCAG G 11 Cl ' l GCCG 
AGACGACGATTAATATTAAGAAAAATCACAGTATTACrCTTATGCCTGCCATCGATTTTTTGATGGCAAG^ 
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TTGCAC>CCCAAGGATTTGGACCGAATCCTGGTAGCTGAAGGGCCGGGTAGCrATACAGGCTTCCGAAT^^ 
AGCAACTGCTAAG ACCT TAGCTCACACCCTGAACATCCAGTTGGTTGGTATCTCGAGTCTOT 
CATCAACAAGAAGO i 11 U 11 iUlCCCCTrGA TGGA TGCGCGTCGCAATAATGTTTATGCAGGATTTTATGAAAATG 
CCAAACCTGTC\TGGCAGAAGCGCACCTATCrrTT 
5 CCTTrGTCCGAGAAGTTGGCCC CTTTG TTGAGCAGATTCAAAAACACTTGCCAA 

GCCCAATGCAGCTAATCTAGCTCTTTTGGCCTGGGACAAGGAj\GCAGACTCCrrGCATGAT^ 

CTCAAACGAGTCGAGGCTGAGGAAAACTGGCTCAAGAACCATACCGAGTCrGGCGAGTCrrACATTAAACGCCTA 

TGA 

10 4149.3 

ATGATAGAAATCAAGCGAATTCAACAACAGCCTGACCTAGCTCAAGCCATCTACGCTCTTATGGCAGCTCTT^ 
TAGTCAGTCCTTGGACTCTGGAGCAAATCCAAGCAGATCTGTCCCAAGACCACACTTGCTATGCATTGGCTTATGA 
TGGGGCAGAAGTGATTGGATTTCTAGCTGTGCAGGAGAATCn 1 1 1 GAAGCAGAAGTCCTGCAAATCGCTGTCAA 
AGGAGCTTATCAGGGTCAGGGGATTGCGT OgCCn TGTTTGCTCAATTGCCGACAGACAAGGAAATTTTCCT 
15 GTCAGACAGTCAAATCAACGAGCGCAAGCATTTTACAAGAAAGA.AAAGATGACAGTTATCGCTGAGCGAAAGGC 
CTACTACCATGACCCAGTCGAGGACGCCATTATCATGAAGAGAGAAATAGATGAAGGATAG 

4152.2 

ATGACAAAACAAGTCTTATTAGTGGATGATGAAGAACACATTCTGAAATTCCTTGACTA^^ 
20 GGCITTTCTACrCAATTCGTGACAAATGGACGGAAGGCCTTAGCTTTGGCAGAAACAGAACC 

TGCTTGATATC ATGTT ACCACAATTAGATGGCATGGAAGTTTGTAAGCGGCTGAGAGCCAAAGGCGTCAAAACTC 
CAATTATG ATGo i i i CTGCGAAAAGTGATGAATTTGATAAGGTTTTGGCCTTGGAATTAGGGGCTGATGACTACCT 

gaccaagccttttagccctagagaatrgctggcgcgtgtcaaggctgtcctcaggcgaactaaaggagaacaaga 
aggacatgattcagataatatcgctgacgattcttggctatttgggaccttgaaagtataccctgagcgtcatgaa 

25 GTCTACAAGGCGAATAAGTTACTGAGTTTGACCCCAAAAGAATTTGAAAGCGATAAAAATCC LiI mi IG AAGTTT 

TCAAAGTrtCGAAAGTAACCGCCCAATAA 

4154.1 

ATGACTACTTTTAAAGATGGATTTTTATGGGGTGGTGCTGTTGCTGCTCATCAACn^ 
30 GTGCCAAGGGAATTAGTGTTGCTGATCTTATGACTGCTGGTCGTCATGGAGTAGCTCGTGAAATTACTTTGGGACT 
TTTAGAGGGTAAATATTATCCAAATCATGAGGCGATAGATTTTTATCACCGTTATAAAGAAGATATAGCAC^^ 
GCTGAAATGGGATTCAAGT GCTTC CGTACCTCTATTGCATGGACACGTATCTTTCCAAAAGGTGATGAGTTAGAGC 
CGAATGAAGAA^GATTACAGTTTTATGATAATCTTTTTGATC 

TCTATCT CATTT TGAAATGCCTTATCACTTAGTGACCGAATATGGTGGTTGGAAAAATAGGAAAT^ 

35 TTTGCTCGTTTTGCAGAAGTCGTATTTAAACGTTACAAAGATA^ 

ATAATCAAGCGAATTATCAGGAAGATTTTGCACCATTTACTAACTCAGGTATTGTATATGAGGAAGGTGATAATAG 
AGAAGCA ATTAT GTATCAAGCAGCACATTACGAATTAGTTGCTTCTGCACGAGCTGTAAAAATTGGTCATGAGATT 
AATCCAGATTTTCAAATAGGTTGTATGATTGCGATGTGTCCAATTTATCCAGTTACTTGCAATCCr 
- AATGGCAATG AAAGCTATGCAGAAGCGTTATTATTTTGCTG ATGTGCATGTTTTAGGTAAATATCC^ 

40 TTCAAGTATTGC GAAC GAAAAGGTATTTCAGTTGATTTTACTGCCCAGGATAAAGAAGATTTAC^ 
TAGATTACATTGGTTTCAGTTACTATATGTCCTTTGCTATCGACTCTCATCGTGAAAATAATCC^ 
CTTGAAACAGAAGATTTAGTGAAAAATAATTATGTTAAGGCTTCTG>^ - 

ttgcgttatgcgttaaattggtttacagaccactatcacttaccactctrrattcttga^ 
agatcaagttgcagcagatggtatggtacatgatgattatagaattgaatatctaggtgcccatattcgtgaaatg 
45 aaaaacgctgtagttgaagatggtgttgatttaatgggttatactccatggggatct 

gtaccggtgaaatgc ggaaa cgttatggctttatttatgtagataaagatgataatgggaagggaagttataatc 
gttccccgaaaaaatcttttgcctggtataaggaagttattrcatctaacggtgaatcagtagaatag 

4154.2 

50 ATGGATCAACAAAACGGO 1 lUl I'lGGl 1 1 ICriGAAAACCATGTTATGGGACCAATGGGCAAACTTGCTCAGTTTA 

AAGTA GTAC GTGCTATC ACGG CTG CAGGT ATGGCTGCTGTACCATTTACTATTGTAGGATCAATGTTTTTG 
CA GTAr rTTGCCACAAGCTTTCTCATTTTGGCCAATTGTGGCAGATATTTTCTCTGCT^ 
CACTTTACATGGTrGCAAACTATGCGACTATGGGTTCTCTATCTCTTTATTTCGTTCTAT^ 

ACAAAAj\rTTATGCAGAGGAAGAAGAACTCAATATGAATCCTCTTAATGGTGCCTTGCTTGCCTTGATGGCrm 
55 TCATGACAGTACCGCAAATCATTTTTGATGGTGGAATGATGAAGACTGTGACAAGTCTAAAAGAAGGTGCAGTA^ 
TTGCAGATGGATGGGCAATGGGAAATGTAGTCGCACGTTTTGGGACAACAGGGATTTTTACCGCAATC^ 
CAATTOTGACTGTTCrrATTTATCCTATGTGTGT^ 
GGAGTTTCTCGTGGATTTACCGCTjrGGTTCCGGGAT^ 
/:a GTA GCAA TGGGAACAGATATTTTTAAAGTCATTG CAAT TCCATTTGGTTTTGTATCCAATC^ 
OU TrGGTTTAATGATTATTrATCTATTGACTCAACTACTTTGGATTGTAGGTATCCACGGTGCGAACATTGT^^ 
TTTGTTAGTCCAATTGCTCTTGCTAACATGGCTGAAAATGCTGCTGGCGGGCACT^ 
CTAATATCTTTGTAATTGCAGGTGGTrCTGGTGCAACTTTAGGACTATCTTTATATAT^^ 
GAACAGCTTAAAGCAATAGGACGAGCATCTGTAGTTCCAGCCTTATTTAATArrAATGAACCATTAAT^^ 
TACCTATTATCTATAj^CAGCCTTGGCTATACCATrTATTTTACCACCAATGGTTACTG 
05 GCGAATTCTCTAAACTTTATTAAGCCAATTATCGCACAGGTTCCATGGCCAACTCCAGTAGGGATTGGAGC^ 
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TAGGCACAGCAGATCrrCGAGCrGTArrAGrrGCTCTACrrATCTGCATTTGCAGCATTCCrAGTCTA^ 
ATCCGTGTATATGATCAAAAATrGCTGAAAGAAGAGCAAGCTATCTAA 

4155.1 

5 ATGAAAAAATTTTATGTAAGTCCA AZ M 1 i CCTAT TCrA GTAGGATTGATTGCGTTTGGAG TCTTATC CACTTTCAT 

TA Tl i 1 iUi TAATAATAATCTGTTGACGGTTTTAA n I ' l G T ' l ILl U 1 I GTAGGAGGCTATGl i 1 i 1 i iAT TTAAG AA 
ACTGAGAGTGCATTATACAAGGAGTGATGTAGAACAGATACAGTATGTAAACCACCAAGCGGAAGAAACTTTGAC 
AGCTCTATTGGAACAGATGCCTGTAGGTGTTATGA AATT GAATTTATCTTCTGGAGAGGTTGAGTGGT^ 
TATGCTGAATTGATTTTGACCAAGGAAGATGGTGATTTTGATTTAGAAGCTGTTCAAACGATTAT^ 

10 TAGGAAATCCGTCTACTTATGCCAAGCTTGGTGAGAAGCGTTATGCTGrrCATATGGATGCTTCTT 

GTATTTTGTAGATGTATCCAGGGAACAAGCCATAAC>.GATGAATTGGTAACAAGTAGACCAGTG ATTC 
CTCTGTGGATAATTATGATGATTTGGAGGATGAAj\CTn[CTCAGTCAGATATTA 
AAriTTATATCAGAGTTTTCAGAAAAAO\CATGATGTTTrCTCGTCG 
TGACTA(>CGGTGCTTGAGGGCTTGATGAATGATAAATTTTCTGTTATTGATGCTTr^^ 

15 AGACAGTTGCCCTTGACCTTAAGTATGGGA n 1 iCl I ATGGCGATGGAAATCATGATGAGATAGGGAAAGTTGCTT 

TGCTCAATTTGAACTTGGCTGAACTACGTCGTGGCGACCAGCTGGTTGTTAAGGAAAACGACGAAACGAAAAA 
CAGTTTATTrrGGTGGTGGGTCTGCTGCTTCAATCAAGCGTACACGGACTCGTACGCGCGCTATGA^ 
TTCAGATAAGATTCGGAGTGTAGATCAGGl 1 1 I IGTAGTCGGTCACAAAAATTTAGACATGGATGCTTTGGGCTCT 
GCTGTAGGTATGCAGTTGTTCGCCAGCAATGTGATTGAAAATAGCTATGCrCTTT 

20 CAGATATTGAACGAGCTGTTTCATTCATAGAAAAAGAAGCAGTTACGAAGTTGTTGTCTGTTAAGGATGC^ 

GGATGGTGACCAATC O 1 iCl I I GTTGATTCTTGTAGACCATTCAAAGACAGCCTTAACATTATCAAAAGAATTTTA 
TGATTTATTTACCCAAACCATTGTTATTGACCACCATAGAAGGGATCAGGATTTTCCAGATAATGCGCTTATTA 
TATATCGAAAGTGGTGCAAGTAGTGCCAGTGAGTTGGTAACGGAATTGATrCAGTTCCAGAATTCTAAGAAAAAT 
CGTrrGAGTCGTATGCAAGCAAGTGTCTTGATGGCrGGTATGATGTTGGATACTAAAAATTTCACCTC 

25 CTAGTCGGACATTTGATGTTGCTAGCTATCTCAGAACG CGCG GAAGTGATAGTATTGCTATCCAGGAAATCGCT^ 
GACAGAtTTTGAACAATATCGTGAGGTCAATGAACTTATTTTACAGGGGCGTAAATTAGGTTCAGATCT 
GCAGAGGCTAAGGACATGAAATGCTATGATACAGTTGTTATTAGTAAGGCAGCAGATGCCATGTTAGCCATGTCA 
GGTATTGAAGCGAOl 1 1 IG I 1 CI'IGCGAAGAATACACAAGGATTTATCTCTATCTCAGCTCGAAGTCGTAGTAAAC 
TGAATGTACAACGGATTATGGAAGAGTTAGGCGGTGGAGGCCACnTTAATTTGGCAGCAGCTCAAATTAAAGATC 

30 TAACCTTGTCj\GAAGCAGGTGAAAAACTGACAGAAATTGTATTAAATGAAATGAAGGAA^ 
TGA 

4156.1 

ATGAAAGAGAAAAATATGTGGAAAGAATTGTTGAATCGTGCAGGCTGGAIM'IGGTCI 1 li 1 ACTTGCCGTCCTTT 
35 TATATCAGGTTCCCCTAGTGGTTACCTCTATTTTGACTTTAAAAGAAGTAGCCCTGCTACAGTCA GGGCT G 
TGCTGGCCrrrCAATTGTGGTTCTGGCTCrATTTATTATGGGAGC^ 
CTTTTTTTAGAGCTAAAGATTTGGCACGTTTGGGCTTGAGTTATCTAGTTATT 

ATTTTATTGCAACTGTCAAj^TGAGACGACAACAGCTAACCAGTCTCAGATTAATGATATGGTTCAAj^ 
TGATTTCCAGTTTCTTCTTGCTAGCCTTGCnTGCTCCGATrTGTGAGGAAATC^ 

40 AAGATTTrCCGAGGCAAGGAGAACTTGGGATTTGTAGTCGGTACGATTGTCTTTGCTTTATTGCATC 

ATTTAC Li ILi 1 l ATTGATTTATGGAGGTATGTCGACAGTTCTATCTTGGACAGCCTACAAGACCCAACGTTTGGA 
AATGTCGATCTTGCTTCACATGATTGTTAATGGGATTGCTTTCTGTTTGTTGGCTCTTGTGGTGA^ 
CATTAGGAA Tfl CI'GTTTAAATGAAAGACAAAAATATGTGGAAAGAATTGTTGAATCGTGCAGGCTGGA riTlGGT 
LI 1 H l ACTTGCCGTC Cl i i I ATATCAGGTTCCCCTAGTGGTTACCTCTATTTTGACTTTAAAAGAAGTAGCCCTGC 

45 TACAGTCAGGGCTGATAGTTGCTGGCCnTTCAATTGTGGTTCTGGCTCTATTTATTATGGGAGCTCGTAAA^ 

GTTAGCTAGTTTTAA I 1 i llLl 1 i 11 1 i agagctaaagatttggcacgtttgggcttgagttatctagttattgtcg 

GGTCAAATATACTTGGTTCCATTTTATTGCAACTGTCAAATGAGACGACAACAGCTAACCAGTCTCAGAT^ 
TATGGTTCAAAATAGTTCGTTGATTTCCA Gri iCl ILi 1 GCTAGCCTTGCTTGCTCCGATTTGTGAGGAAA TCrrG T 
GTCGTGGGATTGTTCCTAAAAAGATTTTCCGAGGCAAGGAGAACTTGGGATTTGTAGTCGGTACGATT 
50 TTTA1TGCATCAACCAAGTAATTTACC1 [CM 1 ATTGATTTATGGAGGTATGTCGACAGTTCTATCTTGGACAGCCr 

ACAAGACCCAACGTTTGGAAATGTCGATCTTGCTTCACATGATTGTTAATGGGATTGCTTTCTGT^ 
GTGGTGATTATGAGTCGGACATTAGGAATTrCTGTTTAA 

4156.4 

55 ATGGATACACAAAJ^GATTGAAGCGGCTGTAAAAATGATTATCGAGGCTGTAGGAGAGGACGCTAATCGCGAGGGC 
TTGCAGGAAACACCTGCTCGTGTAGCCCGTATCTATCAAGAGA l 1111 1 CAGGTCTTGGTCAAACAGCAGAGGAAC 
ATTTGTCAAAATCCTTTGAAATTATTGACGATAATATGGTGGTAGAAAAGGATAT L ' rr I ' l i CCATACCATGTCTGA 
ACACCACTTCTTGCCATTTTATGGTAGAGCGCACATTGCCTACATTCCAGATGGTCGTGTGGCAGGCTTGTCTAAG 
CTAGCCCGTACGGTTG AAG 1" ! " I ATTCG AAAAAACC ACAAATTCAAG AACGTTTG AATATCG AAGTGGCCGATGCC 

60 TTGATGGACTATCTAGGTGCTAAAGGAGCCTTTGTTGTCATTGaGGCGGAACATATGTGTATGAGTATGCGTGGTG 
TTAGAAAACCAGGCACTGCAACCTTGACGACAGTAGCTCGTGGTCTATTTGAAACAGATAAGGATCTCCCTGACC 
AAG CTT ATCGTTT AATGGGG CT AT AA 



4157.2 
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ATGAAAGACTTGTTTTrAAAGAGA.AJVGCAGCCCTTTCGTAAGGACrrGTCTTGC^ 
ACCACTTrGTCrrGTTCCTGCTTGTCCTCTTGGGCTTTCTAGCCTACCACT 

GAAAATCATTGGCCTATC Ll 1 i tOl i 1 GTAGGAATTACGTCT O 1 1 i l A Ll iTr ACnTGGGGAGGAACTGCCACCTA 
TATCGAGGCTCCAGACAAGCTCTTTCTCTTACT TCGAG AAGAGGAAATTAAGCTCCAT CTCA^ 
5 CATTTCCCTAGTC 1 rrrCGCTCTTTCTACAGACCCl 1 1 ICI IGCTGTTATTTGCGCCTTTAl 1 1 1 I AGCAATCGGTT A 

TGGCTTGCCAG 1 1 1 11 CTGCTCTATGTGCl I M ATTGGGGGTAGGAAAATATTTCCACn 1 1 GTCAAAACGCCAGCA 
AATTTTTCACTGAAACTGGaCTGGACTGGGACTATGTTATTTCTCAAGAAAGCAAGCGTAAGC^ ' 
TTTCTTTGCCCTCTTTACGCAGGTCAAGGGAATTTCAAACAGCGrrAAGCGTCG 

AGGCTGTTCAGAAGGTGCCTGGCAAGATTTGGCAAAATCTCTATCTCCGrrCTTATCTGCGAAATCGCGACCT 
10 TGCTCTCAGTCTTCGTCTTCTCTTCCTTTCCTTGCTGGCGCAGO 1 1 I ri'ATCGAGCAAGCTTGGATTGCGACAGCAG 

TGGTAGTTCTCTrrAACTACCTCTTGCTCTTCCAGTTGCTGGCCCTCTATC^ 

CAACTCTTTCCGCTGGACAAGCGGCAAAAGGAAAAAGGCTTACAGGAGGTAGTTCGACGATTGACCAO I 1 1 IGIT 

TTACTTGTGGAATTA G TT G TTGGGTrGATTACCrrCCAAGAAAAACTAGCCCTTCTACCCTT 

TG UI 1 1 l ACTAGTCrrGTATTTGCCTTATCAGGTAAAACGTCAGATGCAGGACTAA 

15 

4158.1 

ATGAGAAAATCAATAGTATTAGCGGCAGATAATGCCTATCTTATTCCTTTAGAGACGACTATAAAGTCTGT^ 
ATCACAATAGAGATGTTGA t I t I i ATATTCTCAACAGTGATATAGCTCCTGAATGGTTTAAATTATTGGGGAGAAA 
AATGGAAGTTGTGAATTCTACAATTCGCAGTCTACACATTGATAAAGAA L ' I 1 m GAAAGCTATAAAACAGGACCT 
20 CATATAAATTATG L 1 ILl l ACTTTAGA l 1111 1 GCGACAGAAGTGGTTGAATCTGATAGGGTATTGTATCTGGATTC 

CGATATCATTGTAACTGGGGAACTAGCTACrri O i I 1 GAGATAGATCTCAAAGGATATTCAATTGGTGCTGTTGAT 
GATGTCTATGCCTATGAAGGACGAAAATCTGGATTTAATACTGGTATGTTACTAATGGATGTTGCAAAGTGGAAA^ 

aacattctattgtcaatagttrattggaattagcggccgagcagaatcaacttgttcatcttggggatc^ 
tttaaatatttattttgaggataattggctagccttagataaaacatataattatatggtggct 
25 accttgctcaagaatgtgaacgtctagatgacaatccacctacaattgttcactatgctagtcatgataaa ccttg 

GAATACATATAGTATATCTAGACTACGTGAATTATGGTGGGrrrATAGAGATrrGGATTGGTCAGAGATTGCl'l I I 

caacgttccgatttaaattattttgaaagaagcaatcagtctaaaaaacaagtgatgc^ 
gatataaaacatttagagtatttagtacaacggttacctgattggcattttcatttggct 
tgaggagctgacctctctatcacagtatacgaatctaacagtatatcaaaatgtattacatagtagaattgattgg 
30 ctattggacgattctatagtttatttagatattaatacaggtggagaggtttttaatgtacttaca^ 

aaagtggcaagaaaatcttcgcri rigatatcacacgtaaaagtatggatgatggactctatgacggtattt^ 
tgtggagagaccagatgatttagtggatagaatgaagaatatagagatagactaa 

4158.2 

35 ATGACTAAGATTTATTCGTCAATAGCAGTAAAAAAAGGACTATTTACCTCATTTCTACT(^^ 

GAACTCGTATTATTCTCC CITI I GTTGACCTAAATACTAAAGA i ri ri ' l AGGAGGTTCAACAGCCTATCTAGCCTTC 
TCAGCCGCCCTAACAGCTGGGAATCTAAGAACTTTATCAAI ill! ICTGTTGGATTATCCCCTTGGATGTCCGCCA 
TGATTTTATGGCAG ATb 1111 ClTl'n CTAAACGGTrGGGTTTAACATCTACGTCTATAGAAATACAAGATCGCCGT 

aaaatgtacctgaccttgctaattgctgtgattcaatccttggcagttagcttgagactgcca 

40 ATTCTGCAATATTGGTTGTTCTAATGAATaCAATATTGCTGATAGCAGGAACA I 11 1 UCJl i GTTTGGTTGTCAGAT 

TTAAATGCGAGTaTGGGGATTGGAGGTTCTATTGTAATCCTCCTATCCAGTATG Cj 1111 AAATATTCCTC AGGAT G 
TTTTGGAAACATTTCAGACAGTACACATTCCAACAGGGATTATTGTGTTACTTGCTTTATTAACC 
TATTTACTTGCCCTTATGTATCGAGCTCGCTATTTGGTTCCTGTTAATAAAATTGGCTTACACA^ 
CTATTCTTATCTCG AAATCATGTTGAATCCTGCAGGTGGGATCCCTTATATGTATGTG ATGAG ITl 1 C 1 1 AGTCTAC 

45 CAGCTTATTTGTTCATCTTGTTGGGATTTATTTTCCCTAATCATTCAGGGTTAGCGGC^^ 

GTTGGAAAGCCTTTGTGGGTCTATGTTTATATTTCGGTCTTA l 11 11 ATTTAGTATCA ri ' 1 ' 1 I G CI ' l ' l ' lG T TACGATG 
AATGGAGAAGAGATTGCAGACCGTATGAAAAAATCTGGAGAATACATTTATGGTATTTATCCAGGTGCGGATACT 
AGTCGATTTATTAj\TCGATTGGTCCTrCGTTTCTCAGTCATAGGTGGTCTCTTTAATCTGATTATG^ 
CATGCTTTrTGTTTTGTTTGATGAAAAGT^ 

50 TGATTTTTACGATTAGAGACGAGGTCAAGGCTTTAAGGCTAAATGAGACCTATAGACCTTTGA^ 

4158.3 

ATGTCCTCTCrrTCGGATCAAGAATTAGTAG^ 

TAGACGATATTTTGGTTGAAG Ll 1 1 I GCTGTGGTGCGTGAAGCAGATAAGCGGATTTTAGGGATGTTTCCTTATGA 
55 TGTTCAAGTCATGGGAGCTATTGTCATGCACTATGGAAATGTTGCTGaGATGAATACGGGGGAAGGTAAGACCTT 
GACACCTACCATGCCTGTCTATTTGAACG C i 1 1 1 I CAGGAGAAGGAGTGATGGTTGTGACTCCTAATGAGTATTTA 
TCAAAGCGTGATGCCGAGGAAATGGGTCAAGTTTATCGTTTTCTAGGATTGACCATTGGTGTACCATTTACGG 
A TCC AA AGAAG GAGATGAAAGCTGAAGAAAAGAAGCTTATCTATGCTTCGGATATCATCTACACAACCAATACTA 
ATTTAGGTTTTGATTATCTAAATGATAACCTAGCCTCGAATGAAGAAGGTAAG I 1 f 1 l ACGACCGnTAACTATGT 
60 GATTATTGATGAAATTGATGATATCTTGCrrGATAGTGCACAAACrCCTCTGATTATTGCGGGTTCT^ 

AGTCTAATTAC TATG CGATCATTGATACACTTGTAACAACCTTGGTCGAAGGAGAGGATTATATCTrrAAAGAGGA 

gaaacaggaggtttggctcactactaagggggccaagtctgctgagaatttcctagggattgataatttatac^ 
ggaagagcatgcgtcrrttgctcgtcatttggtttatgcgattcgagctcataagctctttact 
tatatcattcgtggaaatgagatggtactggttgataagggaacagggcgtctaatggaaatgactaaacttcaa 
65 ggaggtctccatcaggctattgaagccaaggaacatgtcaaattatctcctgagacgcgggctatggcctcgatc 
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acctatcagagtctttttaagatgtttaataagatatctggtatgacagcgacaggtaagct 

tttattgaaacitacaatatgtctctagtacgcattccaaccaatcgtccgagacaacgga 

ttgactatccagataatctatatatcactttacctgaaaaagtgtatgcatccttggagtacatcaag caat ^ 

TGCTAAGGGAAATCCTTTACTCCTITTTCTAGGCTCAGTrGAAATGTCTCAACTCT 
5 AAGGGATTGCCCATAATGTCCTAAATGCTAATAATGCGGCGCGTGAGGCTCAGATTATCTCCGAGTCAGGTCAGA 
TGGGGGCTGTGACAGTGGCTACCTCTATGGCAGGACGTGGTACGGATATCAAGCTTCGTAAAGGAGTCGCAGAGC 
TTGGGGGCTTGATTGTTATTCGGACTGAGCGGATGGAAAGTCAGCGGATCGACCTACAAATrCCTGGCCGTTCTGG 
TCGTCAGGGAGATCCTGGTATGAGTAAATTTTTTGTATCCITAGAGGATGATCTTATCAAGAAAT^ 
TGGGTGCATAAAAAGTACAAAGACTATO^GGTrCAAGATATGACTCAACCGGAAGTATTGAAAGGTCG^ 
10 CGGAAACTAGTCGAAAAGGCTCAGCATGCCAGTGATAGTGCTGGACGTTCAGCACGTCGTCAGACTCrrGGAGTAT 

GAGGATGTTCTTGTGGATATCATTGAGAGATATACAGAAGAGGTAGCGGCTGATCACTATGCTAGTCGTGAATTAT 
TGTTTCACTTTATTGTGACCAATATTAGTTTTCATGTTAAAGAG 

GCAGTTCGTAGCTTTATGAAGCAGGTGATTGATAAAGAACTTTCTGAAAAGAAAGAATTACTTAATCA^ 
15 TATATGAACAGTTTTTACGACrrrCACTGCTTAAAGCCATTGATGACAACTGGGTAGAGCAGGTAGAC^ 

ACAGCTATCCATGGCTATCGGTGGTCAATCTGCTAGTCAGAAAAATCCAATCGTAGAGTACTATCAAGAAGCCTA 
CGCGGGCrrrGAAGCTATGAAAGAACAGATTCATGCGGATATGGTGCGTAATCTCCTGATGGGGCTGGTTGAGGT 
CACTCCAAAAGGTGAAATCGTGACTCATTTTCCATAA 

20 4158.4 

ATGATAGGGACTTTCGCCGCTGCTCnTGTAGCTGTACTAGCAAATTTCATCGTCCCTATTGAAATTACCCCA^ 

GTGCCAATACTGAAATTGCACCACCAGATGGGATTGGGCAGGTTCTCACCAACCTC TTGCTC AAACTGGTTGACA 

ACCCAGTCAACGCCCTGCTTACTGCTAACTATATTAGAATCTTATCTTGGGCAGTCATTT^ 

AGAAGCCAGTAAAAATAGTCAAGAATTGCTAAAAACTATCGCTGACGTGACTTCTAAAATTGTCGAATGGATCA^ 
25 CAATCTGGCTCCATTTGGAATCCTTGGT CriOi 1 1 1 I AAAACCATTTCTGACAAGGGAGT CGGAA GCCTTGCCAAC 

TACGGTATTTTATTGGTTCTATTAGTAACGACTATGLl 1 li iOi 1 GCCCCTGTGGTCAACCC nTGATT GCCn Ci fC 
TTTATGAGACGCAATCCTTACCCTCTAGTTTGGAACTGCCTCCGTGTCAGCGCTGTGACAGCCTTT^ 
GTTCTGCGACTAACATTCCTGTCAACATGAAACTCTGCCATGACCTTGGACTCAACCCAGATACCTAT^ 
TATCCCACTCGGTTCTACTATCAATATGGCTGGAGTAGCGATTACCATTAACCTTTTGACCCITGCTGCAGT^ 
30 ACrCTTGGAATrCCTGTTGACTTTGCCACAGCCTTTGTCCTCACTG TGGTA GCAGCTATCTCATCCT 

AGGTATTGCCGGAGGTTCCCrC C ' i ' lCl l ATCCCAGTTGCrrGTAGCCl I I [ CGGTATTTCTAACGATATTGCCATAC 

AAATTGTTGGGGTTGGTTTTGTGATTGGTGTCATCCAAGACTCATGTGAAACAGCCCTT^ 

CCTCTTTACCGCCGTTGCCGAATACGCAGCAACCCGTAAAAAATAA 

35 4158-5 

ATGTCTATTAGCCAACGTACCACCAAGCTCATCTTAGCTACCTGTCTTGCCTGCCTGCTTGC TTATT ^ 
TTCGTCAGCAGTTTCGGCTGGAATTATCGCTCTCTTGAGCCTATCTGATACGCGTAGAAGTACTTTAAAACTGGCT 
CGCAATCGT C illVi T CTATGCTTCTAGCTCTGGCTATCGGTGTTCT AG CT 1 fl CACTTG AGCGG ATTTCATATCTG 
GAGTCTCGGCCTCTATCTGGCCTTCTACGTTCCTTTAGCCTACAAGATGGGCTGGGAAATTGGCATCACA CCAA GC 

40 ACTGTTTTGGTTAGCCATCTCTTGGTTCAAGAGTCAACCTCTCCAGACCTTCTAGTCAATGAA^ 

TATTGGTACAGGATTrGCCTTGCTTGTTAATCTCTATATGCCTTCACGAGAAGAGGAAATCCAGCACTACCA 
CTGGTGGAAGAAAAGTTAAAAGATATCCTCCAGCGCTTCAAATACTATTTATCCAGAGGAGACGGACGCAACCGA 
GCACAGCTGGTAGCAG AATTAG ACACG C 1111 G AAAG AAGCCCTCAG ACTGGTCTATTTGGATCACTCTG ACCACC 

tctttcaccagacagactaccatatccactactttgagatgagacagcgacaaagtcc tatcc tgagaaacatgg 
45 cccaacagartaacacttgtcaccttgccgccagtgaaagcctgatcttagcgcaactcttttc^^ 

tcaactgagccagaccaatcctgcttctgatttgctagatgaaattgaacgttatctggaagtcttccggaaccgc 

agtctgcccaagacaagagaagaatttgaaacccgcgccacccttcttcaact 

atccaagtaaaagttgatttttaccaaaaatatagacactaa 

50 4158.6 

ATGGAAATCATGTCGCTTGCGATTGCTG rriM I GCCGT CATCA TTGGTITAGTCATTGGATATGTCAGCATCTCAGC 
TAAGATGAAATCATCTCAGCAAGCTGCAGAGTTGATGCI I I IAAATGCTGAACAAGAAGCAACTAATTTACGTGG 

acaagctgagcgtgaagcggatttacn g ttaatgaagccaaacgtgaaagcaagtctcttaaaaaagaagcact 
attggaggccaaagaagaagccagaaaataccgtgaagaagtggacgctgaattcaaatcagaacgtcaag 

55 tcaaacaaatcgaaagtcgtttgacacagagagctactagccttgaccgtaaggacgacaatttgacgact 
aacaaacacttgaacaaaaagaacaaagtatttctgatagagcgaaaaaccttgatgcgcgtgaagagc^ 
aggaactcgaaagacaaaaagaagcagaactagagcgtattggtgcgctgtctcaggcagaagcacgagatatt 
atcttggctcagacagaggaaaacttgaccagggagattgccagtcgcattcgcgaagctgagcaagaggtcaag 
gaacgttctgacaaaatggccaaggacatcttggttcaagctatgcaacgtatcgctggtgaatatgtagcggag 

60 tcaacaaactcaacagttcatctgccagacgatactatgaagggacgcattattggtcgtgaaggtcgtaacatt 
cgtacctttgaaagtttgacagggctcgatgtgattatcgacgatacaccagaagtggtgaccttgtcacgat^ 
atccgattcgtcgtgagattgcccgtatgactatggaaatgttgctcaaagatggtcctatacatccagctcgtat 
cgaagagttggttgagaaaaaccgtcaagagattgacaataagattcgtgaatacggtgaggctgctgcctatga 
aattggtgcgccaaaccttcatccagacttgatgaagattatgggacgtttgcagttccgtacitcatatcgacaa 

65 aatgttttgcgccattcgattgaggttgctaagttggctggtatcatggcgagcgaacttggtgaaa^ 
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TTGCCCCTCCTGCTGGATTCCTTCACGATATCGGGAAAGCCArrGACCATGAGGTTGAAGGTACCCACCrrGA^ 
CGGTATGGAATTGGCCCGTAACTACAAGGAACCCCCAGTTGTGGTGAATACGATTGCTAGTCACCACGGAGATCT 
TGAAGCTGAGAGCCTGATAGCAGTTATCGTCGCTGCAGCAGATGCCTTGAGCGCAGCCCGTCCAGGTGCTCGTAG 
TGAGTCTCTTGAAAGCTACATCAAGCGTCTCCATGATTrGGAAGAAATTGCTAACGGCTTTGAAG 
5 AGCTTTGCCCTTCAAGCAGGACGTGAAATTCCTATCATGGTCAATCCAGGAAAAATCAAGGACGAC^ 

ATCTTGGCTCACAAAGTTCGTAAGAAAATTGAAAACAATCTCGATTATCCAGGAAATATCAAGCTAACCCT 
CGCGAGCTTCGTGCAGTAGA1TATGCTAAATAA 

4158.7 

10 ATGATGTTAAAACCCTCTATTGATACCTTGCTCGACAAGGTTCCTTCAAAATATTCACTCGTAAT^ 

AACGTGCCCACGAATTGGAAGCAGGTGCCCCAGCAACTCAAGGTTTCAAGTCTGAAAAATCAACrCTrCG 
TAGAAGAAATCGAATCAGGAAACCTTACAATTCACCCAGATCCAGAAGGAAAACGTGAAGCACTGCGTCGCCCTA 
TCCAAGAAGAAAAACGCCGCAAAGAAGAAGAAGAAAAGAAAATCAAAGAGCAAATTGCTAAAGAAAAAGA^ 
TGGTGAAAAAATTTAA 

15 

4161.1 

ATGTCAGCATATCAATTACCGACCGTATGGCAGGATGAAGCTAGTAATCAAGGAGCr nTA CGGGGCTAAACAGA 
CCAACAGCAGGTGCCCGTTTCGAACAAAACTTGCCAAAAGGAGAACAAGCTTTTCAGCTr^ 
CCAAATGGTGTGAAGGrrACTATCTTATTGGAAGAATTACTAGAAGCTGGrriTAAGGAAGCGCC^ 
20 ATAAGATTGCTATCATGGATGGGGATCAATTCGGATCAGACTTTGTGAAGCTCAATCCAAATTCCAAGArrCCA 
CTTATTCGACCAGTCAGGTACTGAAAACGTAAGAGTCTTTGAGTCTGCTCATATTCriCl U ACCTTGCTGAGAAA 
TTTGGAGC Cl i i 1 1 ACCAAGTAATCeTGTGGAAAAGCTAGAAGTTTTGAATTGGCTATTCTGGCAAGCAGG TGCAG 
CAC Ci 1 i I CrAGGTGGGGGATTTGGACA rriLU - I CAATTATGCTCCTGAAAAATTGGAATATCCTATTAACCO i i i 

acgatggaagtgaaacgccagttggatttattggataaggaattggctcagaaaccttatattgcaggcaatcac 
25 tatacgaytgcagatattgctatctggtcttggtatggacagttagttcaaggaaatctttaccaj^ggt^ 

aa 1 1 cll ggatgcctcaagttatcaaaatctagtaaaatgggcagaaaaaattgccaatcgtccagctgttaagc 
gtggcttggaagtaacttatacagaaattaaatag 

4161.2 

30 ttggcaagcrrcatcacttctatcatcatgttctatgtcggtttcgatgttcta^ 

cagtcgggaagaaacggtcattgatcctcttgctgcaactctaggaatcattrctgcagcgattatgtt t^ 
tatctctacaatactcgcctcagtaagaaatccaactccaatgcgcrgaaggcagctgctaaggacaatctt^ 
acgctgttacctcacttggaaccgccattgccatcctagctagtagtttcaattatccgattgtgg ataa ac^ 
tgctatcatcatcactttctttatcrrgaagactgcctatgatatcttcatcgagtct^ 

35 gcrrrgacgaccgcctgctcgaggactaccaaaaggctatcatggaaatrcccaaaatcagcaaggt c^ 

aaagaggtcgcacctacggtagcaacl\tctacctggatattacactagagatgaatcctgacttgtctgrr^ 

aagccatgaaatcgcggatcaggtccagtctatgctggaggagcgttttggcgtctttgataccgatgtccatatc 

gaaccagcacctatccctgaggatgaaattttagacaatgtctataaaaaattgcttatgcgtgaacaat^ 

ACCAAGGAAACCAACTAGAAGAACTCTTGACTGATGATTTTGTCTATATTCGCCAAGATGGAGAGCAGATGGATA 
40 AAGAGGCTrATAAGACCAAAAAAGAGTrAAATTCTGCTATCAAGGACATTCAAATTACTT^ 

CCAA ACTCATCTGCTATGAGTTAGATGGTATCATCCATACCAGTATCTGGCGTCGCCACGAAACCTGGCAAAATAT 
CTTTCATCAAGAAACCAAAAAAGAATAG 

4162.1 

45 atgacaattaaactagtagcaacggatatggacggaacct tccta gatgggaatggacgctttgatatggatcgt 
ctcaagtctctcrtggtttcctacaaggaaaaagg gattt actttg^^ 
tagaaaaattatttgctggtgttcctgatgacattattttcatcgcggaaaatggcagtt^ 
tcaggacttgtatgaagcgactatgtctcgtgacl ti l atctggcaacri i 1 gaaaagctg aaaacttcaccttat 
gtagatatcaataaactgctcttgacgggtaagaagggttcatatgttctagatacggttgatgag accta tttga 

50 aagtgagtcagcactataatgaaaatatccaaaaagtagcgagtttggaagatatcacagatgacatt^ 

ttacaaccaacttcacagaagaaacgctggaagatggggaggcttgggtaaacgaaaacgttcctggtcrraacg 
ccatgacaactgccrrrgaatccattgatarrgttctggactatgtcgataagggagtggccattgttgaat^ 
taaaaaacttggtatcacaatggatcaggtcatggcin ri ggag acaatcttaatgacitacatatgatgc^ 
gtgggacatcctgtagctcctgaaaatgcacgacctgaaattrragaattagcaaagactgtgattggtc^ 

55 aggaacgctccgttatagcttatatcgagggcttataa 

4162.2 

ATGGCAGATATAAAATTGATTGCATTGGACTTGGACGGGACCTTGCTGACTACTGATAAAAGGCTGACGGATan' 
ACCAAGGAAACCnTGCAAGCTGCGCGTGATCGTGGTATCAAGGTCGTATTGACAACTGGTCGTCCCTTAAAAGCC 

60 ATGGAl 1 ILl I ICTCCATGAGTTAGGGACTGACGGTCAGGAAGATGAGTATACCATTACl I 1 I AATGGTCGATTAG 

TTCAGAAAAATACAGGAGAAATCCTTGATAAAACAGT Cl 11 I CATATGATGATGTGGCACGTTTGTATGAAGAAAC 
AGAGAAATTATCaCTGCCTCTTGATGCCATCTCAGAAGGAACAGTTTATCAAATCCAATCGGACCAAGAAAGTCT 
TTATGCCAAATTCAATCCAGCTTTGACC 1 n g ri CCaGTGGACTTTGAAGACTTATCTAGTCAAATGACCTACAAC 
AAATGCGTGACTCCCTTTGCTCAAGAACCCTTGGATGCAGCCATTCAGAAGATTTCTCCAGAATTGrr^ 

65 ATGAAATCrrTAAATCACGTGAAATGTTGCTAGAATGGTCACCAAAGAATGTTCATAAAGCAACAGGTTTGGC/^ 
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AACTAATCACCCATCTTGGAATCGACCAAAGTCAAGTGATGGCTTGTGGTGACGACGCCAATGACCTCT 

TrOAATGCGCAGGTCnTGGTGTTGCTATGCAAAACGCTCTrCCTGAAGTAAAGGCAGCCGCAAATGTAG^ 

CGATGACCAACGATGAGGAAGCTGTCGCCTGGGCTATCGAAGAATATGTGCTAAAGGAGAACTAA 

5 4164.2 

ATGGAAACTTTACTTATTCTATTATTAATTGCCAATCTAGCTGGTCTCTTTCTGATT^ 

AGGAGAAACACTTAAGTAAGAGCTTGGAGGATCAGGCAGATCATTTGTCAGACC^GTTGGATTACCGCT^ 
AAGCCAGACAAGCCAGCCAGTTAGACCAAAAAGATTTGGAAGTGGTTGTCAGCGACCGTTTGCAAGAACT 
TTGAATTGCACCAAGGTCTGACCCAAGTCCCTCAAGAAATGACAGATAATCrCCTCCAAACTAGAGACAA 

10 ACCAACGTCTCCAAGCCTTGCAGGAATCAAATGAGCAACGTTTGGA^ 

TAGAAAAGACCTTGCAGACACGCTTACAGGCTTCCTTTGAGACAGTTTCTAj\ACAACT 
CCTTGGAGAAATGCAGACAGTTGCCCGTGATGTCGGAGCTCTTAACAAGGTTCTCTCTGGAACCAAGA^^ 
GATTCTGGGAGAj\TTGCAACrGGGGCAj\ATTATTGAAGACATCATGACACCTGCCCAGTACGAACGAGAATACCC 
AACGGTTGAAAACTCTAGTGAACGAGTGGAGTATGCCATCAAGTTACCCGGACAAGGCGACCAAGAATACGTCTA 

15 TCTGCCAATTGACrCTAAGTTTCCACTGGCAGATTATTACCGCrrGGAj\GAAGCCTATGAGACAGG^ 

GAGATTGAACGCTGTCGTAAGTCACTCCTAGCAAGCGTCAAGCGCTTTGCTAGGGATATTAGGAACAAGTA 
GCACCACCTCGGACGACCAATrTTGGA Ul 11 f G i ' I I (j 1 1 CCGACAGAAGGTCTCTACTCAGAAATCGTCCGCAATC 
CGGT Cll CI 1 i GATGATTrGAGACGGGAAGAACAGATTATTGTTGCAGGACCAAGTACCCTATCAGCCCUl'Cri AA 
CrCCCTATCAGTTGGTTTCAAGACCCTTAATATCCAAAAGAGTGCCGACC\TATCAGCAAGACTC^ 

20 AAGACCGAGTTTGGCAA Ci 1 1 lO GTGGTATTCTGGTCAAGGCACAAAAACATCrCCAACATGCCTCTGGCAATATrG 

ATGAATTATTAAACCGTCGTACCATAGCTATCGAGCGGACGCTCCGTCACATTGAGTTGTCAGAAGGTGAGCCTGC 
GCTTGATCTACTCCATTTTCAACAAAATGAGGAAGAATATGAAGATTAG 

4164.3 

25 ATGAAGATTAGTCACATGAAAAAAGATGAGTTATTTGAAGGCTTTTACCTAATCAAATCAGCTGACCT 
ACTCGAGCTGGGAAAAACTACCTAGCCTTTACCTTCCAAGATGATAGTGGCGAGATT^ 

gcccaaccrcataacattgaggcctttaccgcaggtaaggttgtccacatgaaaggacgccgagaagrttataac 

aatacccctcaagtcaatcaaattactctccgcctgcctcaagctggtgaacccaatgacccagctgatttc^ 

tcaagtcaccagttgatgtcaaggaaattcgtgactacatgtcgcaaatgatittcaaaattgaaaatcct 

30 gcaacggattgtccgaaatctctacaccaactatgataaggaattcractcctatccagctgccaagacc aacc ^ 
ccatgccttrgaaacgggcttggcctatcatacggcgaccatggtgcgtttggcagacgctattagcgaagtttat 
cctcagctcaataagagcctgctctatgcggggattatgttccatgacttagctaaggtcatcgagttgacggggc 
cagaccagac^gagtacacagtgcgaggtaatcttcttggacatatcgctctcattgatagcgaaattaccaaga 
cagttatggaactcggcatccatgataccaaggaagaagtcgttttgcttcgtcatgtcatcctcactcaccacgg 

35 cttgcttgagtatggaagcccagtccgtccacgcattatggaagcagagattatccatatgattgacaatcrggat 
gcaagcatgatgatgatgtcaacagctcttgctitggtggataaaggagagatgaccaataaaatctrccc^ 
gataatcgttccttcrataaaccagatttagattaa 

4166.2 

40 ATGAGTGAAAAAGCTAAAAAAGGGrTTAAGATGCCTTCATCTTACACCGTATTATTGATAATCATTGCT 
CAGTGCTAACTTGGTTTATCCCrGCGGGGGCCTTTATAGAAGGTATTTACGAGACTCAGCCTCAA^ 
GATTTGGGATGTCCTCATGGCACCGATTCCGGCTATGCTAGGTACTCATCCAGAGGAAGGTTCGCrCATTAAAGAA 
ACGAGCGCAGCGATTG ATGTAGC C i 1 L Tl CATCCTTATGGTTGGTGGTTTCCTTGGCATTG TCAA CAAAACTGGTG 
CTCrTGACGTAGGGATTGCCTCTATCGTGAAGAAGTATAAGGGCCGCGAAAAAATGTTAATTTTGGTACTGA^ 

45 rrrGTTTGCCCTCGGTGGTACAACTTATGGTATGGGTGAAGAAACAATGGCCTTCTATCCACTCCTTGTGC^^ 
ATGATCGCCGTTGGTTTTGATAGCCTGACTGGTGTTGCAATTATTTTGCTCGGTTCTCAAATCGGCT 
TACTCTGAATCCATTTGCGACAGGTATTGCTTCAGCGACTGCGGGAGTTGGTACAGGGGACGGTATCGTACTTCCT 
CTGATCrrCTGGGTTACCTTGACTGCTCTTAGTACTTGGTrrGTTTACCGTTATGCGGATi^ 
GACTAAGTCACTGGTTTATAGTACTCGCAAAGAAGATTTGAAACACTTTAACGTAGAAGAATCTTCATCT^ 

50 TCTACACTTAGCAGCAAACAAAAATCAGTTCT H ICl 1 ATTTGTGTTGACATTCaTCTTGATGGTATTGAGCTTCAT 

TCCATGGACAGACCTTGGCCTTACCA l rn 1 GATGACTrrAATACTTGGTTGACTGGTCTTCCAGTTATTGGTAATA 
TTGTCGGTTCATCTACTTCTGCACrAGGTACTTGGTACTTCCCAGAAGGCGCAATGCTCTTTGCC^^ 
CTGATTGGTGTTATTTATGGTCTTAAAGAAGATAACATTATCTCnTCCTTCATGAATGGTGCTGCTC 
TGTTGCCTTGATCGTAGCGATTGCTCGTGGTATTCAAGTTATCATGAACGACGGTATGATTACCGATACAATCCTC 

55 AACTGGGGTAAAGAAGGCTTGAGCGGTCTATCTTCACAAGTCTTTATCGTTGTAACTTATATCITCT 
GTCATTCTTGATCCCATCTTCATCTGGTCTTGCCAGCGCAACTATGGGTATCATGGC 
TCCACTTGGAGAATTTGTAAATGTCCGTCCTAGCTTGATTATCACTGCTTACCAATCTGCTTC^ 
TGATTGCACCAACATCTGGTATTGTGATGGGAGCTCTTGCACTTGGACGTATCAACATTGGTACTTGGTGGAAAT^ 
CATGGGCAAACTCGTAGTCGCTATTATTGTAGTGACCATCGCCC'i i LTl CTCCTTGGAACCTTCCTTCCATTCCTAT 

60 AA 

4166.3 

ATG AAAATAG ATATAACAAATCAAGTT AAAG ATG AA 11 ' 1 C 1' 1 AT ATCATTAAAAACCTTG ATTTCCTATCCTT CAC 
TACTCAATGAAGGAGAAAATGGAACACCTTTTGGACAAGCAATCCAAGATGTCCTAGAAAAAACTTTAGA 

65 gtcgagacataggtttcactacctatcttgaccctaaaggttattacggatatccagaaatcggtcagggagcag 
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AGCTTCTGGCCATTCTCTGTCATTTGGATG ri GT TCCATCAGGTGATGAACCAGATTGCCAGACACCGCCATTTGA 

AGCAACTATCjVAAGACGGCTGGCTATTCGGACGTGGTGTCCAAGATGATAAAGGCCCTTCGCTCGCAGCT 

TGCAGTAAAAAGCTTGCTGGACeAAGCTATTCAGTTCAAAAAGCGCGTACGCm 

AACCCTCTGGCGCTGCATGGCACGCTACAATACCATCGAAGAACAGGCCAGTATGGGCTTTGCACCTGACTCATC 
5 Vi'li CCTCTG ACCTATGCTG AAAAAGGGCTTCT ACAGGTCAAACTTC ATGGCCCTGG ATCGG ATCAACTAG AGCTT 

gaagtaggaggcgcci IIAACGTTGTACCAGACAAGGCCAACTACCAAGGTCTCCTCTATGAACAGGTTTGTAAC 
GGTCTCAAAGAAGCTGGTTATGATTACCAAACCACTGAACAAACCGTAACGGTTCTCGGAGTGCCAAAGCATC 
AAGGATGCTAGTCAAGGTATCAATGCTGTCATCCGACTAGCTACCATTCTTGCTCCTCTCCA^ 
TCA Om ICl I GCAACACAAGCAGGTCAAGACGGOVCAGGAAGACAAATCTTTGGTGATATAGCAGATGAACCTT 

10 CTGGTCACCTATCCTTTAATGTCGCAGGTCTCATGATCAATCATGAACGTTCTGAAATCCGTATTGAC^ 
TCCTGTCTTAGCTGACAAGGAAGAACTAGTAGAGTTGCTrACAAGATGTGCACAAAACTACCAACT 
AGAGTTTGACTATCTAGCGCCTCTATACGTCGCAGAAGACAGTAAACTCCTTAGCACACTGATGCAAATCTACCA 
AGAAAAGACTGGCGATAACACTCCTGCTATTTCATCCGGTGCTGCCACri I I GCTCGCACCATGCCAAATTCTGTA 
GCCTTCGGCGCCTTATTCCCAGGAGCGAAGCAGACAGAACATCAGGCAAATGAATGTGCCGTTCTAGAAGATTTG 

1 5 TACCGTGCTATGG ATATTTATGCCGAAGCCGTCTATCGACTTGCAACTTAA 

4169,1 

ATGTCT AATTCATTTGTCAAGTTGTT AGTCTCTC AATT ATTTG CAAATTTAGCAG AT A 1 " 1 ' 1 I CI " I " I AG AGTAACAAT 
CATTGCTAACATATACATTATrrCAAAATCAGTAATTCCCACATCACTAGTrCCTATCrrAATAGGA^ 

20 TTGTrGCCAGT L i 1 1 1 AGTTCCGTTCGTTACTAAAAGGTTAGCGCTAAATAGG O 1 1 1 1 ATCTTTATCTCAATTrGG A 

AAGACTATATTATTGGCGATACTGCTAGGAATGTTTACCGTAATGCAATCCGTAGCGCCTTTGGTGACCTATCTAT 
TTGTTGTTGCAATTTCCATACTAGATCG I'l'l'l GCAGCACCCGTTTCCTATGCTATTGTGCCACGCTATGCGACCGAT 
TTGGGTAAGGCTAATTCAGCCTTATCAATGACTGGTGAAGCTGTTCAATTGATAGGTTGGGGAT^ 
TGTITGCAACAATTGGTCrGTTACCTACCACGTGTATCAATTTAGTCTTGTATATCATTTCTAGCT^ 

25 TTTCrrCCTAACGCTGAAGTGGAGGTCTTAGAGTCAGAAACTAATCTTGAAATTTTGCT^ 

TTGCTAG'AAATCCTAGATTAAGA Ci i 1 1 I GTATCAGCAAATTTATTGGAAA l 1 1 I ' l I CAAATACGATTTGGGTTTCT 
TCCATTATACTTG I Hi I'G 1 AACGGaGTTATTAAATAAAACGG AAAGTTACTGGGGATATTCTAATACAGCATACT 
CTATTGGTATTATAATTAGTGGCTTAATTGCl 1 11 AGGCTATCTGAAAACTTCCTTGCTGCTAAATG GGAA CCCCA 
ATTATTC ACCCCA AATCTAAAAACCATCCAGAATC CTTG CCTTAGCTTAGATCCTGGATGG 1 1 1 LI 11 1 1 iCACCCA 

30 ATGGGTGTTTTTTACTAGACAAAAAAGAGTTTCCCCTTTAT^ 

aaactcacatgaacagtttaccaaatcatcacttccaaaacaagtl 1 1 it 1 ACCAACTATCTTTCGATGGAGGTCA 

TTTAACCCAGTATGGTGGTCrrAT Li 1111 I CAGGAA ClTm ' l CCCAGTTGAAACTAAAAGAGCGGATTTCTAAGT 
ATTTAGTAACGAATGACCAACGCCGCTACTGTCGTTATTCGGATTCAGATATCCTTGTCCAGTTCCTCr^ 
TTAACAGGTTATGGAACGGACTATGCTTGTAAAGAATTGTCAGCTGATGCCTACTT 
35 TCCAAAATTGTTGGAAGGAGGGCAGCTrGCTTCACAGCCAACCTTATCCC Gl 11 ILl 11 CCAGAACTGACGAGGAA 

ACAGTCCATAGTTTGCGATGCCTCAACCTTGAATTGGTCGAATT Li 1 I M ACACTTTCACCAGCTAAACCAACTCA 

ttgtagatatcgattctacccatttcacaacttatggcaagcaagaaggtgttgcttataacgcccact 
tcatggctatcatcctctttatgctttcgaggggaagacaggttattgtttcaatgcccagcttcgtcctggtaat^ 
gttatt gttct gaagaggcagacagcntatcacacctul 1 1 1 agaacggtttaatcaacttctcntcgaatgga 
40 tagtggctttgcgaccccaaaattatacgatttaattgaaaaaacagggcaatactacctcataa^ 

aaatactgttctgagccgtctrggagacctttccctcccrrgcccacaggatgaggacrtaaccatcttgccccac 
tccgcctactcagaaactcrrctatcaagcaggatcttggtcgcacaagcgtcgtgtctgccagttctctg aacga a 

AA^AAGGAAACTTGTTCTACGATGTTATTTCTCTCGTTACAAATATGACGA 

GCTTTATCGTGGACGTGGTCAAGCCGAGAATTTCATCAAGGAGATGAAGGAGGGArrrri 1 GGCGATAAAACGGA 
45 TAGTTCAACCTTAATCAAAAACGAAGTTCGTATGATGATGAGCTGTATCGCCTACAATCTCTATCIU-rilCTCA^ 
CATCTAGCrGGAGGTGA(nTCCAAACTTTAACAATCAAACGCTTCCGCCAT C,l 1 1 ' l ' lLl 1 CACGTGGTGGGAAAAT 
GTGTTCGAACAGGACGCAAGCAGCTCCrCAAATTGTCTAGTCTCTATGCCTATTCCGAATTGTT^ 
TTCTAGGATTAGAAAAGTCAACCTGAATCTTCCTGTTCCTTATGAACCACCTAGAAGAAAAGCGTCGTTAAT^ 
CATTAA 

50 

4169.3 

ATGATGGAGil 1 1 1 1 CAACAGCrTCCTCATTTAGAGCCATATGGCAATCCTCACTA lll 1 G 1 i 1 ATGTGATTGCTGC 
AACCrrGCCCATCTTTATACGTCTCTTTTTCAAGAAACGCTTTGCCTGGTATGA^ 
TTGTCACCATGTTGGTGGGTGGAAAGACCAATCAACTAGCTGCCTT^ 
55 CCT GCTrrrCT ACAA GCATTATCGAAAAAGCAAGGATGGCAAGTGGGTCTTCTACT^ I UCTGTCCCTA 

CTTCCG ATTA TCl T 1 0"1 CAAGGTGCAACCAGCTATCAATGG AACGCAGTCTTTGCTTGGGTTCTTGGGAA 1 1 1 C 1 lA 
CCTGACCTTTCGTTCGGTTGGAATTGTpVTCGAGCT 

CTCCGTTTCCTTCTCTTCATGCCAACTTTCTCGACTGGTCCAATCGATCGCm 

GGCTATTCCTGAGCGAGA TGAGT TCATGGATATGCTGGATGAATCTGTCCGCTATATCATGTGGGGCl'l 1 riGTAT 

60 AAGTTTATCCTAGCTCATGTrTTAGGAGAGACCTTACTACCTCCTCTGAAGy^ 

TCT TTAA TCTCTATGCCTTGGCAGTTATGTATA C 1 1 1 I GGTCTGGAACT Cl 1 LTl ' l GACTTTGCAGGTTATTCTATGT 
TTGCTTTGGrcATCTCAAACTTGATGGGAATCCGTAGCCCTATCAACTTTAACAAG 
AAAGGAGTnTGGAATCGCT GGCA TATGAGTCTGTCCTTCTGGTTCCGTGACTTTGTCTTTATC 
GTGTTAACCAGAAAGAAAGTCTTTAAAAATCGTAATGTAACCTCAAGCATGGCCTACATTGTAAA 

65 TGGGATTTTCGCATGGTGTGACCTGGTACTATATCGCCTATGGACTCTTTCATGGACTAGGCTTGGTCATCAATGA 
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TGCCTGGCTTTCGCAAGAAAAAAACGCTCAATAAGGAACGGAAAAAAGCAGGGAAGGCTCCCCTACCTCAG^ 

GCTGGATTCAGTrGCTrGGCATCGTrGTCACTTTCCATGTTGTCATGTTGTCATTCTT^^ 

ATAATCTATGGTTTAAAAAATAA 

5 4169.4 

ATGCTTAAACGCTTATGGATGATCTTCGGACCGGTCTTGATCGCTG O t i lOll G Oi I ' i I I CTGCTC ATm criTi'AT 
CCTACTCAGATGCATCATAATCTAGGAGCTGAAAACCCTTCAGCAGTGGCTACTACTATCGATAGl i i lAAGOAGC 
GAAGTCAAAAAGTCAGAGCACTATCTGATCCAAATCTGC Gi I I ' l GTTCC Ci ' iCI 1 I GGCTCTAGTGAATGGCTTCG 
TTTTGACGGTGCTCATCCTGCGGTATTACCTGAGAAATACAATCGTTCCTACCGTCCTTATCl l l I AGGACAGGGG 
10 GGAGCTGCATCGCTTAACCAATATTTTGGAATGCAACAGATGTTACCACAGCTGGAGAATAAACAA(^ 
GTTATCTCACCTCAGTCGTTCAGTAAAAATCCCTATGATCCAGCAGCCTTCCAGC^ 

TGACTAOl 11 ICTGAAACATCAATCTGGGGATCAGGCTAGTCAATATGCAGCGACTCGCTTACTGCAACAGTTCCC 
AAACGTAGCTATGA AGGA CCTGGTTCAGAAGTTGG CAAGTAA AGAAGAATTGTCGACAGCAGACAATGAAATGA 
TGAATTATTGGCTCGTTTTAATGAACGCCAAGCTTC CI i i I 1 I GGTCA Oi 1 Tl CGGTTAGAGGCTATGTTAACTACG 
15 ATAAGCATGTAGCTAAGTATTTAAAAATCTTGCCAGACCACjI 1 1 ICriATCAGGCAATAGAAGATGTTGTCAAAGC 

AGATGCTGAAAAAAATACTTCCAATAATGA GATG GGAATGGAAAATTATTTCTATAATGAGCAGATCAAGAAGGA 
TTTG AAG AAATTAAAGG ATTCTCAG AAAAGC r I " 1 ACCTATCTCAAGTCGCC AG AGTATAATG ACTTGCAGTTGGTT 
TTAACACAGTTTTCTAAATCTAAGGTAAACCCGATTTTTATCATTCCACCTGTTA^^ 

CTGGTCTACGAGAGGATATGTACCAACAAACGCTGCAG AAGATTCCCTACCAGTTAGAAAGTCAAG U I I i ' l ACCA 
20 ATATAGCAGATTTTTCTAAGGACGGCGGGGAGCCTTTCTTTATGAAGGACACCATTCACCTTGG^ 

GTTGGCl I 1 IGACAAGGCAGTTGATCCTTTCCrATCCAATCCCACACCAGCTCCGACTrACCATCTGAATGAGCGC 
TTTTTCAGCAAAGATTGCGCGACTTATGATGGAGATGTCAAAGAATTTCAATAG 

4169.6 

25 ATCGAGAAAAACCTCAAGGCTTTGAAACAAACAACAGACCAAGAAGCCCCAGCAATTGAACCTGi^^ 

GGATACCAAGACA GTCC AA AATGG TTACTTCGAGGATGCAGCTCTCAAGGACCGCACCTTGAGTGACTATGCAGG 
TAACTGGCAATCACTTTATCCTTTCCTTG AAGACGGCACU 1 1 i GACCAAGTCnTGACTACAAGGCTAAGTTGACT 
GGTAAGATGACCCAGGCTGAGTACAAGGCrrACTATACAAAAGGCTATCATACAGATGTGACTAAGATTAACA^ 
ACTGATAATACrATGGAATTTGTTCj\AGGTGGACAAAGCAAGAAATACACTTACAAG^ 

30 TTGACTT ACAA GAAAGGCAATCCTGGCGTGCGlMU'CCTCTTTGAAGCCACAGATGCTGACGCrGGACAATTC 

ATGTTCAGT TTAGT GACCACAATGTTGCCCCAGTTAAGGCAGAACATTTCCATAT CriCl 1 I GGAGGCACAAGCCA 
AGAAGC CCT CTTTGAAGAAATGGACAACTGGCCAACCTACTACCCAGATAACCTATCTGGCCAAGAAATCGCCCA 
AG AAATGTTCG CGCATTG A 

35 4170.3 

ATGAAAGATG GTCAT TTGCTAGCCCATCATATTCGT TTGTT GAATGGGCGGA i 1 1 1 1 CAAAAGTTACTG AGTC AAG 
ATCCTGAGGCTCTTTATAGGGGTGAACAGGGCAAGATTTTAGCGGTTTTATGGAATAGTGAAACTGGCT 
CTGCGACAGATATCGCGCrrGCGACTGGACTTGCGAATAATACGCTGACGACTATGATAAAAAAGCTAGAGGAA^ 
AAAAGCTTGTAATTGTTAGTCCGTGTGGAAAAGACAAGCGTAAGAAGTATTTA Cil 11 1 AACGGAGTrAGGCAAGT 
40 CCCAGAA AGAA GTGGGGCATCGTCTCAGTCAGAAATTGGATACTAT Cl 1 11 ACAAAGGA ITI ' I CAGAGGAAGAAA 

_ TTCACCAATTTGAAGGTTTTCAAGAAAGAATTTTGGCGAATCTGAAAGAGAAGGGAAATGAGGTT^ 

4170.4 

a tgact aatttaattgcaacttttcaggatcgttttagtgattggttgacagctct 
45 gcttttgacci 1 0 11 actagctattttgcttgcgattcccttggctg 1 11 i ici 1 cgctatcatgagaagctggccg 
actggctcttgcagattgcaggtattttccagaccatcccgtctctggccttgttggggctc^^ 
ggaattgggaccttgccggctttgacagctctagtgatttatgcgattttccctattttgc^^ 

GGCTGAAGGGAATTGATCCGAACCTGCAAGAGCCTCGGATTGCCTTTGGGATGACCAGATGGGAACGTCTCA^ 
AATTTGAAATTCCACTCGCCATGCCTGTTATCATGTCTGGGATTCGGACGGCACCT O l I 1 1 GATTATCGGTACGGC 

50 AAC CTTGG CGGCCTTGA TTGGT CCAGGGGGACTAGGTTCCTTT ATT L l 1 I ' l GGGAATTGACCGTAATAATGCCAGT 

TTGArri'lGATTGGG GCAC l'l IL'l'l CTGCA GTGCTAGCC ATTGC CTTTAACTTCCTACrAAAAGTGATGGAAAAAG 
C AAAATT ACGG ACGATTTTCTCAGGTTrTGCCTTGGTGGCTTTATTACTGGGTCT 1 1 U G 

GTTCAAAAAGAGAAGGAAAACTTGGTTATTGCTGGGAAAATAGGTCCAGAACCAGAAATTTTGGCC>^ 
AAGTTGCTGATTGAAGAAAATACCAGCATGACTGCGACTCTTAAACCGAATTTTGGGAAGACAAGCTTCCTT^ 

5 5 AAGCTCTG AAAAAAGGCG ATATTG ACATCTATCCTG AATTTACTCGTACGGTGACTGAAAGTTTGC^ 

ACCCAAGGTGAGTCATGAj\CC>GAACAGGTTTATCAGGTGGCGCGTGATGGCATTCCTAAGCAGGATCATCTAGC 
CTATCTCAAACCC^TGTCTTATCAAAACACCTATGCTGTAGCTGTrCCGAAAAAGATTGCT 

aagaccatttcagacttgaaaaaagtggaagggcagttgaaggcaggttttacactcgagtttaa^ 
gatggaaataagggcttgcaatcaatgtatggtctcaatctcaatgtagcgaccattgagccagcccttcgct 
60 aggctattcagtcaggggatattcaaatcacggatgcctattcgactcatgcggaattggagccttatgatttaca 
cgtcttcgaagatgacaagcaactcttcccaccttatcaaggggctccactcatgaaagaagctc^ 
acacccagagttggaaagagttcntaatacattggctggtaagattacagaaagccagatgagccagctcaacta 
ccaagtcggtgttgaaggcaagtcagcaaagc^j\gtagccaagcagtttctccaagaac 
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4170.5 

ATGATGCATACTTATTTGC^AAAGAAAATTGAAAATATCAAAACAACCCTAGGTGA-AATCT 
CGTATGCTTGCGGCTATGGCTGATTTAGGATTTTCACGAACTATGAAGGCTATCTGGGATGACCTCT^ 
GTA Oi i \ i GCCCAGTGCATTrATTTCCTC Gn i i AGGAAGTTTTCCTCTCTGGCTGGAGTTGGTTTACGAACATCGT 
5 ATTGTTGACTGGATTGGGATGATTTGTAGCTTGACAGGGATTATCTGTGTAATCTTTGTA TCGCAA GG^ 
GTAATTAT C ' rrrri GGCTTGATTAACTCTCTTATTTACCTrATTTTGGCCCT 
CTGACGACAOTTACTTCACAGTCATGCAGCCAATTGGACTTCTAGTTTGGATT TAT^ 

AAAAGCAGGAGTTTGTCGCGCGTAAACTGGACGCCAAGGGCTGGACAAAGTATCTTTCCATTAGTGTGC^^ 
GGTrGGCCTTTGGCTTC:ATrrATC:j\GTCTATTCGTGCCAATCCTCCCTATCGTGAT^ 
10 TGGGGTAGGGCAAATCCTCATGACAGCTGTTTACCGTGAACAGTGGATATTCTGGGCGGCTACC^ 
ATCTATCTCTGGTGGGGAGAAAGCCTGCAAATTC^AGGGAAATATCTAATTTATCTCATTAAC^ 
GGTATCAATGGAGCAAGGCAGCTAAGCAGAATACTGATTTACTTAACTAG 

4170.6 

15 ATGAGAAATATGAAGGCAAAATATGCTGTTTGGGTGGCrri ri I C'l'l AAATTTGACTTATGCCATTGTTGAGTTTAT 

TGCAGGTGGAGTATTTGGTTCTAGCGCT Cji ILi i CCTGACTCTGTGCATGACTTGGGAGATGCGATTG CAATT GGA 
ATATCAGCTTTTCTAGAAACAATCTCCAATCGTGAAGAAGACAATCAGTACAC CTTGG GCTATAAGC^ 
TGCTAGGAGCCTTGGTAACAGCTGTGATTCTCGTAACGGCCTCTGTTCTAGTCATTTTGGAAAATGTCACG 
TTTGCATCCGCAACCAGTCAATGATGAGGGGATTCTCTGGTTAGGAATTATTGCGAT^ 
20 AGTCTGGTGGTTGGTAAGGGAAAGACAAAGAATGAGTCTATTCTGAGTCTGCATTTTCTGGA AGATA CGCTA 

TGGGTAGCTGTTATCCTGATGGCGATT CjI ILl I CGATTTACGGACTGGTATATCC TAGA TCCTCTTTTGTCCCTTGT 
CA' rn ' L ' l I ' lLTl l ATTCTTTCAAAAGCCCTTCCACGrrrriGCTCTACACTCAAGArri iCl i GG ATGCTGTGCCAG 
AAGGTCTTGATATCAAGCAAGTAAAGAGTGGCCTGGAGCGATTGGACAATGTGGCCAGCCTTAATCAGCrTAA^^ 
TCTGGACTATCGATGCTTTCGAAAAAAATGCCATTGTCCA 
25 TAAAGAGTCTATTCGAATTTTCCTAAAAGATTGTGGTTTTCj^AAATAT^ 

ACTCACdAAACCCATAAGCGAAAGGTGTGTGACTTGGAACGGAGTTATGAGCATCAACATTAG 

4170.8 

ATGATTGAATACAAAAATGTAGCACTGCGCTACACAGAAAAGGATG I Ci 1 GAGAGATGTCAACTTACAGAT TGAG 
30 GATGGGGAATTTATGGTTTTAGTAGGGCCTTCTGGGTCAGGTAAGACGACCATGCTCAAGATGATTAACCG TCT^ 
TGGAACCAACTGATGGAAATATTTATATCGATGGGAAGCGCATCAAAGACTATGATGAGCGTGAACTTCGTC^ 
CTACTGGTTATGTTTTACAGGCTATTGCTCrrrriCCAAATCTAACA GTTGCG GAAAATATT 

ATGAAGGGGTGGAGCAAGGAAGAAATTACGAAGAAAACAGAAGAGCt'riUGGCTAAGGTTGGrri'ACCAGTAGCC 
GAGTATGGGCATCGCTTACCTAGTGAATTATCTGGTGGAGAACAGCAACGGGTCGGTATTGTCCGAGCTATGATTG 
35 GTCAGCCCAAGATTTTCCTCATGGATG AACCCrri 1 CGGCCTTGG ATGCTATTTCGAGAAAACAGTTGCAGGTTCT 

GACAAAAGAATTGCATAAAGAGTTrGGGATGACAACCA l i 1 1 IGTAACCCATGATACGGATG AAGC CTTGAAGTT 
GGCGGACCGTATTGCTGTCTTGCAGGATGGAGAAATTCGCCAGGTAGCGAATCCCGAGACAATTTTAAAAGCGCC 
TGCAACAGACTTTGTAGCAGACTTGTTTGGAGGTAGTGTTCATGACTAA 

40 4171.1 

atgtcagcagttgctatttcagctatgaccaaggttatgcaagaaacccacggaaatccttctagtattcatgct 
atgctcgtcaagctggcaaactcttgcgagaagcccgtcaggaactagcccagttactaaggacaaaacctcaac 
atat cl 1 rn cacntctggtgggactgaaggcaataatactaccatcattggctacrg 1 ci i cgtcaccaag aaca 
aggaaaacatatcatcacaactgccatcgagcaccatgctgtccttgaaacaattgattacttgg ttca acac^ 

45 gggtttgaagcaaccattatccagccagaaaatcaagaaatcacagcccagcaaattcaaaaggctttacgtgac 
gatacgattttggtttctaccatgtttgtcaataatgagacaggaaacctactgcccatcgctgaaat^ 
tactcaagcaacaccctgctgcctatcatgttgatccagttcaggctattggtaaaatcccaattcattcagaaga 
attgggcattgattttctcactgcttctgcccacaaattccatggtcctaagggaatcggttttct 
gcatggactttgattcctatctacatggcggagaccaggaacagaaaaaacgtgcaggaactgaaaatctgcctg 

50 ccattgtaggcatggttgcagccctaaaagaagacctagaaaaacaagaagaacattttcaacatgtacaaaat^ 
tagaaactgcctttctggcagagctggagggcattcagtattaccrgaatagaggaaaacatcatctccc^ 
tctcaatattggatttcctggtcagaaaaatgaccrcttactccttcggctagatt^ 

ctgg ctcag cctgtactgcaggcgttgtcc aatccagccatg i ' i c m g aagccatgtatggcgcaaattcag aacg 
cttgaaggaatcccttcgcatcagtttgtcgccacaaaatacccttgaagacctacaaaccctcgcaaa^ 
55 aaagaaattatcggaggttag 



60 4172.1 

atgttattcaaattatctaaggaaaaaatagagctaggcitatctcgtttatcgccagcccgtcgta ri 1 rrriGA 

GTTTTGCCTrGGTCATTTrACTAGGCTCT L i I C I ' 1 11 GAGCTTCCCCTTTGTCCAAGTTG AAAGCTCACG AGCGACT 
TATTTTGATCAT C T i 1 I CACTGCTGTCTCTGCAGTCTGTGTGACGGGTCTCTCAACCCTTCCACTAGCTCACACCTA 
TAATATCTGGGGTCAAATAATCTGTTTGCTCTTGATTCAGATCGGTGGTCTAGGGCTCATGACCTT^ 
65 TCTAT ATCC AG AGC AAGCAAAAGCTT ACTCTTCGTAGCCGTGCAACT ATTC AGG AT AG I ' l" I" I AGTTATGG AC AAAC 



TCGATCnTGAGAAAGTTTCTCTATTCTA i 1 I t f CTCACGAC H 1 i 1 i GGTTGAG AGCTTGGG AGCTATTTTGCTTA 
G i ' i I I CGCCTrATTCCrCAACTTGGCTGGGGACGTGGT Crrrrr AGTTCCA 1 TlTi CTAGCGATCTCAGCCTTCTGT 
AATGCCCOl 1 1 IGATAATTTAGGGAGCACCAGTTTATTTGCI 1 I 1 CaGACCGATTTACTGGTCAATCTGGTGATTGC 
AGGCTTGATTA-TTACAGGCGGCCTTG Oi 1 t 1 ATGCTCTGGTTTGATTTGGCTGGTCATCTAGGAAGAAAGAAAAAA 

5 ggacgtctgcactttcatacgaagcttgtactattattgactatagg i not lOi L ATTTGGAACA GCAACTA CTCT 

CinUCri GAGTGGAACAATGCTGGAACGATTGGCAj^TCTCCCTGTTGCCGATAAG Urri I AGT TAGCl 11 i i ICAA 
ACAGTGACGATGCGAACAGCTGG CM ri 'CTACGATAGATTATACrCAGGCTCATCCTGTGACT CTTTTG ArrTATA 
TCTTACACATGTTTCTAGGTGGGGCACCTGGAGGAACAGCTGGGGGACTCAAGATTACGACAl 1111 IGTCCTCTT 
GGTCTTTGCACGAAGTGAGCTTCTAGGCTTGCCTCATGCCAATGTTGCGAGACGAACGATCGCCCCGCGAACGGT^ 
10 CAAAAATCCTTT AGTG TCTTTATTAT Ll 1 1 1 IG ATGAG Cl ILl I GATAGGATTGATTCTGCTAGGGATAACAGCCAA 

aggcaatcctccctttatccacctcgtattrgaaaccatttcagctcrragtacagttggtgtaacggc^^ 
actcctgaccttgggaaattggctctcagtgttatcatgccacttatgtttatggg 

ACGAATTCGTCCCTTGACL 1 1 0 ll'l GTTAGCTTGCCAGATTACCATCCAGAAAAGAAAGATATGATTCACTATATG 
AAAGCACATATTAGTATTGGTTAA 

15 

4172.2 

ATGTCAGATCGTACGATTGGAATTTTGGGCTTGGGAAI 1 1 1 ICGGAGCAGTGTCCTAGCTGCCCTAGCCAAGCAGG 
ATATGAATATTATCGCTATTGATGACCACGCAGAGCGCATCAATCAGTTTGAGCCAGTTTTGGCGCGTGGAGTGAT 

tggtgacatcacagatgaagaattattgagatcagcagggattgatacctgcgatacccttgtagtcgcgacagg 
20 tgaaaatctggagtcgagtgtgcttccggttatgcactgtaagagtttgggggtaccgactgttattgctaagg^ 
aaaactcagaccgctaagaaactgctagaaaagattggagctgactcggttatctcgccagagtatgaaatgggg 
cagtctctagcacagaccattc14"riccataatagtgttgatgtctttcagttggataaaaatgtgtctatcgtgg 

AGATGAAAATTCCTCAGTCTTGGGCAGGTCAj^AGTCTGAGTAAATTAGACCTCCGTGGCAAATACAATCTGAATA 
TTTTGGGTTTCCGAGAGCAGGAAAATTCCCCATTGGATGTTGAATTTGGACCAGATGACCTCTTGAAAGCAGAT 
25 CTATATT7TGGCAGTCATCAACAACCAGTATTTGGATACCCTAGTACCATTGAATTCGTAA 

4172.3 

atgaagttattgtctatcgcaatttctagctataatgcagcagcctatcttcattactgtgtggagtcgctagtga 
ttggtggtgagcaagttgggattttgattatcaatgacgggtctcaggatcagactcaggaaatcgctgagtgtt^ 

30 agctagcaagtatcctaatatcgttagagccatctatcaggaaaataaatgccatggcggtgcggtcaatcctgg 
crrggtagaggcttctgggcgctattttaaagtagttgacagtgatgactgggtggatcctcgtgcctact^ 
attcntgaaaccttgcaggaacttgagagcaaaggtcaagaggtggatgtctttgtgaccaattttgt 
aggaagggcagtctcgtaagaagagtatgagttacgattcagtcttgcctgttcggcagatttttggctgggacca 
ggtcggaaatttctccaaaggccagtataccatgatgcactcgctgatttatcggacagatttgtt 

35 cagttctaa 

4172.4 

ATGAAATTCAATC CAAA TCAAAGATATACTCCTTGGTCTATTCGCCCTCTCACTGTCGGTGTTGCCTCAGrT G TTG 
TGGCTACTGGLl ICl 1 1 GTCCTAGTTGGTCAGCCAAGTTCTGTACGTGCCGATGGGCTCAATCCAACCCCaGGTCA 
40 AGTCTTACCTGAAGAGACATCGGGAACGAAAGAGGGTGACTTATCAGAAAAj\CCAGGAGACACCGTTCTCACTCA 

agcgaaacctgagggcgttactgg aaatac gaattcacttccgacacctacagaaagaactgaagtgagcgagga 

AACAAGCCCTTCTAGTCTGGATACACTTTTTGAAAAAGATGAAGAAGCTCAAAj\AAATCCAGA 
CTTAAAAGAAACTGTAGATACAGCTGATGTGGATGGGACACAAGCAJKGTCCAGCAGAAACTaCTCCTGAACAAGT 
AAAAG GTGG AGTGAAAGAAAATaCAAAAGaCAGCATCGATGTTCCTGCTCCTTATCTTGAj\AAAGCTG^ 
45 AGGTCCTTTCACTGCCGGTGTAAACCAAGTAATTCCTTATGAACTATTCGCTGGTGATGGTATGTTAACT 

ttactaaaagcttcggataatgctccttggtctgacaatggtactgctaaaaatcctgct^ 
gattaacaaaagggaaatacttctatgaagtagacttaaatggcaatactgttggtaaacaaggtcaagct^ 

TTGATCAACTTCGCGCTAATGGTACTCAAACTTATAAAGCTACTGTTAAj\GTTTACGGAA^ 

TGACTTGACTAATCTAGTTGCTACTAAAAATGTAGACATCAACATCAATGGATTAGTTGCTAAAGAAACAGTTCA^ 
50 AAj^GCCGTTGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGCCTACCTAGAAAAAGCCAAGGGTGAAGGT 
CCATTCACAGCAGGTGTCAACCATGTGATTCCATACCAACTCTTCGCAGGTGATGGCATGTTGACTCGTCTCT^ 
TCAAGGCATCTGACAAGGCACCATGGTCAGATAACGCCGACGCTAAAAACCCAGCCCTATCTCCACTAGGCGAAA 
ACGTGAAGaCCAAAGGTCAATACTTCTATCAAGTAGCCTTGGACGGAAATGTAGCTGGCAAAGAAAAACAAGCGC 
TCATTGaCCAGTTCCGAGCAAATGGTACTCAAj^CTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGACGGT^ 
55 AACCAGACTTGGACAACATCGTAGCAACrAAAAAAGTCACTATTAACATAAACGGTTTAATTTCTAAAG 

TTCAAAAAGCCGrrGCAGACAACGTTAAACACAGTATCGATGTTCCAGCAGCCTACCTAGAAAAAGCCAATC 
AAGCTCCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACTCTTCGCAGGTCATGGTATGTTGACTCCTCr 
CTTGCTC>AGGCATCTGACAAGGCACCATGGTCAGATAACGGTGACGCTAAAAACCCAGCCCTATCTCCACTAGG 
TGAAAACGTGAAGACCAAAGGTCAATACTTCrATCAATTAGCCTTGGACCGA^TGTAGCTGGCAAAGAAAAAC^ 
60 AGCGCTCATTGACCAGTTCCGAGCAAACGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGA 
CGGTAAACCAGACTTGGACAACATCGTAGCAACTAAAAAAGTCACTATTAACATAAACGGTTTAATT^ 
AACAGTTCAAAAAGCCGTTGCAGACAACGTTAAGACAGTATCGATGTTCCAGCAGCCTACCTAG 



4172.5 
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ATGAAACTAAAAACTTATATITTGGTTGGATATATTATTTCAA.CCCTCrrAACCATTTTGUl 1 1 1 I IGGGCTGT 
TCAAAj\AATGCTGATTGCGAAAGGCGAGATTTACTTTTTCCTTGGGATGAC^ 
GGGATTAGTCTCrrrCTCCTATTGCCAGTCTTTACGTCGTTGGGCAAACrCAAGGACC^ 
CCAAGGATTTTCCTTCAj\ATTTGGACGTTCAAGGTCCTGTAGAATTTCaCCAAT^ 

5 gtcccatgatttgcaggtaacctttgattccttggaagaaagcgaacgagaaaagggcttga 

CTCGCATGATATTAj^GACTCCTATCACTTCGATCCAACCGACGGTAGAAGGGATTTTGGATGCGATTATCAAGGA 
GTCGGAGCAAGCTCATTATCTAGCAACCATTGGACGCCAGACGGAGAGGCTCAATAAACTGGTrGAGGAGTTCAA 
1 nil I GACCCTAAACACAGCTAGAAATCAGGTGGAAACTACCAGTAAAGACAGTA I i 1 1 i ctggacaagctctta 
ATTGAGTGCATGAGTGAATTTCAGTTTTTGATTGAGCAGGAGAGAAGAGATGTCCACTTGCAGC^ 
10 CTGCCCGGATTGAGGGAGATrATGCTAAGCTTTCTCGTATCTTGGTGAj\TCTGGTCGATAACGC^^ 

TGCTCCAGGAACCAAGCTGGAACTGCTGGCTA AGCT GGAGAAGGACCAGCTTrCAATCAGTGTGACCGATGAAG 
G CACCGT ATTGCCCCAG AGG ATTTGG AAAAT ATTTTCAAACG CCTTTATCGTGTCG AAAC^ 
ACAGGTGGTCATGGATTAGGACTTGCGATTGCGCCTGAATTGGCCCATCAATTGGGTGGGGAAATCACAGTCAGC 
AGCCAGTACGGTCTAGGAAGTACCTTTACCCTCGTTCTCAACCrCTCTGGTAGTGAAAATAAAGCCTAA 

15 

4172.6 

AT G ' riU GGTCAAACGGCTCAACATGGTCTTACGAATAGCCTGAAAGACTTCTGGA l ri i 1 CTGCTGAATATAGGTC 
CACAATTGGCG l ' l 1 1 I i I GCCAGATGCTCCGCTGTTCCAGATCGGTTGAGCAGGGTACTGGAAATCACCGTCGTGA 
GTTCAATATGATTCAGCAGATATTCTCGCATTTTGGGATGACTCAGTTGGGACAAATCAAGTTCCTCTATC^ 
20 TCGATTGACCTTGAGTTGCTGGTCAATGCACTTAATCATCACnTGCTCATTGACAGACrGGTCCTCAC^ 

AAATAACGATAGAAATCGACAGGCAGATAGTACATGGTCrrGACCTGCTGAAGGGGCGTAAAGACAAAGAGATTA 

TCGaCATAAAAAGTATGTTC^lGGCAGTTAGAACTGGCTAGCACGCAACAAATCTGTCCGATAAATCAGCGAGTGC 

ATCATGGTATACTGGCCTTTGGAGAAATTTCCGACCTGGTCCCAGCCAAAAATCTGCCGAACAGGCAAGACTGA 

25 4174.1 

ATGGAACATTTAGCAACrrATTTTTCAj\CCTATGGAGGAGCTTTCTrCGCTGCAT^ 

GATTAAGCGGTATGGGCTCTGCTTATGGAGTTGGTAAGGCTGGGCAATCrGCCGCAGCTTTACTGAAAGAAC^ 
CTGAAAAGTTrGCCTCAGCTTTGATATTCCAATT^\TTGCCCGGAA 
TTTAATTTGGTTGCAATTAACTCCAGAACTTCCnTTAGAAAAAGGCGTTGC^ 
30 CTATTGTAGG AT AC 111 ICAGCTAAGCATCAAGG AAATGTAGCAGT AGCGGG AATGCAAATCTTGGCTAAAAG AC 

CAAAAGAATTCATGAAGGGAGCAATTTTAGCTGCCATGGTAGAAACCTATGCAATTCrrGCl 1 1 IGTCGTATCATT 
CATTTTGACCCTTCGTGTATAA 
4175.2 

ATGTTAAAATCAGAAAAACAATCACGTTATCAAATCTTAAATGAAGAATTGTCCTTCCTATTGGAAGGCG AAACC 
3 5 AATGTTTTGGCT AATCTTTCCAACGCCAGTG CTCTCAT AAAATCACGTTTTCCTAAT ACCGTATITGCAGGCTTTTA 

TTTGTTCGATGGAAAGGAATTGGl 1 1 1 AGGCCCCTTCCAAGGAGGTGTTTCCTGCATCCGTATTGCACTAGGCAAG 
GGTGTTTGTGGTGAGCCAGCTCACTTTCAGGAAACTGTTATTGTTGGAGATGTGACGACCTATCTCAACTATAT^ 
CTTGTGATAGTCTAGCTAAAAGTGAAATTGTGGTGCCGATGATGAAGAATGGTCAGTTACTTGGAGTTCTGGATCT 
GGATTCTTCAGACATTGAGGATTACGATGCTATGGATCGAGATTATrrGGAACAATTTGTCGCTATTTTG 
40 AAGACAGCATGGGACTTTACGATGTTTGAGGAAAAATCTTAA 

4175.3 

ATGTCAGTATTAGACATCAAAGATCTTCACGTTGAGATTGAAGGAAAAGAAATTTTAAAAGGGGTTAACCTGACC 
CTGAAAACAGGAGAAATTGCCGCTATCATGGGACCAAATGGTACAGGTAAATCGACTCTTTCTGCCGCTATCATG 

45 GGAAATCCAAACTATGAAGTAACTAAAGGTGAAGTTTTGTTTGATGGCGTAAACATCCITGAGT^ 

GAGCGTGCGCGTATGGGA CM i 1 CCTTGCTATGCAATACCCATCAGAAATCCCTGGAATTACCAATGCTGAGTTTC 
TTCGTGCCGCTATGAATGCGGGTAAAGAAGATGATGAGAAGATTTCAGTTCCTGAGTTTATTACTAAGCTAGATGA 
AAAAATGGAATTGCTCAACATGAAAGAAGAAATGGCAGAGCGTTACCTCAACGAAGGCTTCTCT 
AAAACGCAATGAAATTCTTCAA CI f 1 i GATGTTGGAGCCAACATTTGCT CI ' rn GGACGAGATTGACTCAGGTCTT 

50 GATATTGACGCTCrrAAAGTTGTGTCTAAAGGTGTCAATGCCATGCGTGGTGAAGGTTTTGGTGCTATGAT CA 

CTCACTACCAACGT Ll 1 i 1 GAACTATATCACACCTGATGTGGTACACGTGATGATGGAAGCTCGTGTTGTCCTTTC 
TGGTGGTCCAGAATTGGCTGCGCGTTTGGAACCTGAAGGATACGCAAAATTAGCTGAAGAACTTGGCTACGACTA 
CAAGGAAGAATTGTAA 

55 4174.4 

ATGCCCTACAAAAGACAAAGGA U i 1 m CAATGGCACTTTCTAAACTAGATAGCCTrrATATGCCAGTGGTAGCAG 
ACCATTCGAAAAATCCACATCACCAAGGGAAGTTAGAAGATGCTGAGCAAATCAGTCTCAACAATCCGACTTGTG 
GGGATGTCATCAACCTCTCTGTCAAGTTTGATGCAGAGGA CCGTT TGGAAGATATTG Cl ' IU ' l CTAAATTCAGGATG 
CACG ATTTCAACTG CTTCTG CT AGT ATG ATG AC A 
60 GCG ACTA i ■ I ■ M M CTG AAATGGTTCAAGGGC AAAAAG ATG AGCGTCAAG ACC AACTTGG AGACGCGGCA Tl C 1 TG 

TCAGGTGrrGCCAAATTCCCrCAAAGAATCAAGTGTGCAACCCTAGCTTGGAATGCCCTTAAGAAAACAATTGAA 
AATCAAGAAAAACAGTAA 

4175.5 



% % 
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ATGAAAATTCAAGACCTATTGAGAAAAGATGTCATGTTGCTAGATTTGCACGCAACTGAAAAAACAG CTCr^ 
GACGAGATGATTAAAAATTTGACAGACCACGGTTATGTAACAGATTTTGAAACATTTAAAGAAGGAAr^ 
CGTGAAGCTTTGACTTCTACTGGTTTGGGTGATGGAATCGCAj^TGCCrCACAGCAAAAACG 
GCGACACTTCTATTTGCrAAGTCAAATAAGGGTGTTGACrACGAGAGCTTGGATGGACAAGCAACTGACCTC^ 
5 TCATCATTGCAGCTCCAGAAGCTGCCAATGATACTCACTTGGCAGCCTTGGCAGAATTGTCTC^ 

AGACG Ol ri I GCAGACAAACTTCGTCAAGCAACATCTGCAGACCAAGTTATCGAA Cl 1 1 1 I GACCAAGCTTCAGAA 
AAAACTGAGGAA C I T G I T CAAGCACCTGCTAATGACTCTGGTGACTTTATCGTAGCTGTTACAGCTTGTACAACAG 
GTATTGCCCACACTTACATCGCCCAAGAAGCCCTTCAAAAAGTAGCTGCTGAAATGGGGGrrGCT 
AAACCAACGGTGCTAGCGGTGTTGGAAATCAACTAACrGCAGAAGATATCCCTAAGGCTAAAGCTATTATC^ 

10 CAGCAGACAAGGCCGTTGAAATGGATCGATTTGATGGAAJVJVCCATTGATCAATCGTCCAGTTGCTGAC^ 
CTAAGACAGAAGAGCTAATTAACrrGGCTCTTTCAGGAGATACTGAAGTCTACCGTGCCGCT 
CTGCAACAGCCTCTAACGAAAJ>u\CAAAGCCTTGGTGGTGCCTrCTACAAACACTrGATGAG^ 
TGTTACCATTCGTTATCGGTGGTGGTATCATGATTGCCCTTGCLl 1 L 1 1 GATTGACGGTGCTTTGGGT GTTCCAAAT 
GAAAACCTTGGCAATGTTGGTTCTTACCATGAGTTAGCrrCTATGTTCATGAAAAT^ 

15 GATGCTrCCAGTCTTTGCGGGTT\\TGTTGCCTACTCT 

GGTGCTATTGCCAAAGAAGGTTTTGCCTTTGGTAAAATTCCTTATGCCGCAGGTGGTGAAGC^ 
CAGGTGTCTCATCTGGTTTCCTAGGTGCCC I 'lOnGGTGGATrTATCGCAGGTGCCTTG Gi ICl I GCCATCAAG AAA 
TACGTTAAAGTTCCTCGTTCACTCGAAGGTGCTAAATCAATCCTTCTATTGCCACTTC^ 

GA l ' l IGI TATGCTAGCTGTGAATATCCCAATGGCTGCAATCAACACTGCTATGAATGACTTCCTAGGCGGTCTTGG 
20 AGGAGGTTCAGCTGTCCl-1 CI 1 GGTATCGTCCTTGGTCGAATGATGGCTGTTGACATGGCTGGACCAGTTAATAAA 

GCAGCTTATGTCTTTGGTACAGGTACGCTTG CAGCA ACTO TI i LTl CAGGTGGTTCTGTAGCCATGGCAGCAGTTA 
TGGCTGGAGGAATGCTGCCACCACTTGCAATCTTTGTCGCAACTCTTCTTTTCAAAG 
ACGTAACTCTGGTTTGACAAACATCATCATGGGCTTGTCATTTATCACTGAGGGAGCGATTCCATTTG 
GACCCAGCTCGTGCGATTCCAAGCTTCATCCTTGGTTCAGCAGTAGCAGGTCGACTCGTTGGTCrrACTGGT^ 
25 AACTCATGGCGCCACACGGAGGAATCTTCGTTATCGCCCITACTTCAAATGCTCTCC^ 
GTAGGAGCAATCGTAAGTGGTGTGGTTTATGGTTACCTACGCAAACCACAAGCATAA 

4175.6 

ATGGCAAJ^CAAGAATACAAGTACAACAAGACGGAGACCGTCTAAAGCAGAACTGGAAAGAAAAGAAGCGATTCA 

30 ACGAATGTTGATTTCCTTAGGAATTGCGATTTTATTGATTTTCGCAGCCrrC/^ 

CTTTATAATTTAATTCGCTTGCTAGTGGGTAGCCTAGCTT ATCTGG CGATATTCGGCCT 
TTTCAAGTGGATACGAAAACAGGAAGGACTCTTATCTGGCTTTTTCACCATATTTGCT 
AGGCCTACrrGGTrrGGAAATATGGTTTGGACAAGTCCGTTCTAAAAGGGACCATGGCTCAGGTTCT 
GACTGGTTTTCGAACGACTAGCTTTGCTGGAGGGGGCTTGATCGGGGTCGCTCrTrATATTCC/^ 

35 TTTTCAAATATCGGAACTTACrrrATTGGTrCTATCTTGATTTTAGTG 

TACGATATTGCTGAA l 11 ri CAGTAGAGGCTTTGCCAAATGGTGGGAAGGGCACGAGCGTCGAAAAGAGGAACGC 
TTTGTCAAACAAGAAGAAAAAGCTCGCCAAAAGGCTGAGAAAGAGGCTAGATTAGAACAAGAAGAGACTGAj^^ 
AGCCTTACTCGATTTGCCTCCrCTTGATATGGAAACGGGTGAAATTCTGACAGAGGAAGCTGTTCAA^ 
CCTATTCCAGAAGAAAAGTGGGTGGAACCAG AAATCA TCCTGCCTCAAGCTGAACTTAAATTCCCTGAACAGG^ 

40 GATGACTCAGATGACGAAGATGTTCAGGTCGATTTrrCAGCCAj\AGAAGCCCTTGAATACAAACTTC 

CAACTCTTTGCACCAGATAAACCAAAAGATCAGTCTAAAGAGAAGAAAATTGTCAGAGAAAATATCAAAATC^ 
GAAGCAACCTTTGCTAGCTTTGGTATTAAGGTAACACTTGAj^CGGGCCCAAATTGGGCCATCA 
GAAGTCAAGCCGGCTGTTCGTGTAAGGGTCAACCGCATTTCCAATCTATCAGaTGACCTCGCTCTAGCCTTGGCTG 
CCAAAGATGTCCGGATTGAAGCACCAATCCCTGGGAAATCCCTAATCGGAATTGAAGTGCCCAACT CCGA TATTG 

45 CCACTGTATCTTTCCGAGAACTATGGGaACAATCGCAAACGAAAGCAGAAAA I'll l n GGAAATTCCTTTA GGGA 

AGGCTGTTAATGGAACCGCAAGAGCrrri'GACL'l 11 CTAAAATGCCCCACTTGCTAGTTGCAGGTTCAACGGGTTC 
AGGGAAGTCAGTAGCAGTTAACGGCATTATTGCTAGCATTCrCATGAAGGCGAGACCAGATCAAGTTAAATTTAT 
GATGGTCGATCCCAAGATGGTTGAGTTATCTGTTTACAATGATATTCCCCACCTCTTGATrCCAGTCGTGACCAAT 
CCACGCAAAGCCAGCAAGGCTCTGCAAAAGGTTGTGGATGAAATGGAAAACCGTTATGAACTCTTTGCCi^ 

50 GGAGTTCGGAATATTGCAG G I riU AATGCCAAGGTAGAAGAGTTCAATTCCCAGTCTGAGTACAAGCAAA 

CTACCATTCATTGTCGTGATTGTGGATGAGTTGGCTGACCTCATGATGGTGGCCAGCAAGGAAGTGGAAGATGCTA 
TCATCCGTCTTGGGCAGAAGGCGCGTGCTGCAGGTATCCACATGATTCTTGCAACTCAGCCTCCATCTGTTGATGT 
C ATCT CTGGTTrGATTAAGGCCAATGTTCCATCTCGTGTAGCATTTGCGGTTTCATCAGGAACAGACTCCCGTA 
ATTTTGGATGAAAATGGAGCAGAAAAAC rr C 1 1 GGTCGAGGAGACATGCTCTTTAAACCGATTGATGAAAATCAT 

55 CCAGTTCCTCTCCAAGGCrCCTTTATCTCGGATGACGATGTTGAGCGCATTGTGAACrrCATCAAGACTCAGGCAG 
ATGCAG ACT AC G ATG AG AG 111" I'G ATCCAGGTG AGGTTTCTG AAAATG AAGG AG AATTTTCGG ATGG AG ATGCTG 

gtggtgatccgcri'irigaagaagctaagtcrrtggttatcgaaacacagaaagccagtccgtctatgattcagcg 
tcgtttatcagttggatttaaccgtgcgacccgtctcatggaagaactggagatagcaggtctcatcggtccagct 

G AAGGT ACCAAA CCTCG A AAAGTGTT AC AAC AAT AA 

60 

4176.1 

ATGAGTTATTTTAAAAAATA TAAA TrCGATAAATCCCACTTCAAACTTGGTATGCGAACCTT^ 
CTGrri 1 ICTAGTT CTCT TGA l 11 IT GG Ci I ' ri ' ri 'GGCTGGAAAGGTCTTCAAATTGGTGCTTTGACAGCC G rri ' I - l A 
GCCTGAGGGAGAGTTTTGATGAGAGTGTTCATTTTGGGACTTCGCGTATTCTAGGAAATACTATCGGTGGACTCTA 
65 TGCCTTGGTCnCnCri'ATTAAATACCl M il CCACGAAG CC IT riGGGTGACCTTGGTAGTTGTTCCAATCTGCA 



% % 
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CCATGTTAACCATTATGACAAATOTAGCCATGAATAACAAAGCAGGGGTTAT^^ 

CATTACCCTATCAATTCCAACTGGTGAGACAATriTGTACGTGTTTGTGCCTGTATTAGAAACC^ 

TTTGTCGCAATTATCGTAAATTACCATATTGATCGTATTCCTCTCTTTTTAGAGA^ 

5 4178.2 

ATGAATAAATCAGAACACCGCCACCAACTTATACGCCCTCTTATCACAAAAAACAAGATTC^ 
TTGCAAGCCCTTCTTGCrGAGAACGACATTCAACTAACCCAGGCAACCCTCTCACGCGACATCAAA>^ 
CTATCAAAAGTCCGCGAAGAAGATAGCGCTTATTATCTTCTTAACAATGGTTCCATCTC^^ 
TCGAACTCTACATGGAAGACGCCCTTGTCTGGATGCGCCCAGTTCAACACCAACT^ 
lU ACTCGCTCAATCCrrrGCTTCrATCATTGATACITTGAGCrTCCCT^ 

TCT GTCT TATCATCTGTGAAGATGCAGATACTGCTCAAAAGTGCTTTGAAGAACTGAAAAAATTCGCCCC^ 
TTTCTTTGAAGAATAA 

4179.1 

1 5 ATGAAAAGT AT AAAATT AAATGCTCTATCTTACATGGG AATTCGTGTCTTG AATATTATTT^ 

AACC TATG TCGCGCGTGTCTTGGACCGAACTGACTATGGTTACTTCAACTCACTCGACACTATTTTCT 
TGCCCTTTGCAA^TTATGGTGTCTAT 

TAACAGAAC^ ^ l i i i CTAGTLl i 1 L 1 1 ATTTGTGCATCGCnTGTACGATTTTGACCACTGCTGTCTATATCCTAGCCT 
ATCCT CTLi I CI 1 lACTGATAATCCAATCCTCAAAAAGCTCTACCTtGTTATGGGGATTCAACTCATTGCCCAGA^ 
20 TTTTCAATCGAATG GGTC AATGAAGCTCTGGAAAATTACAGTTTTCrarm 

GATGCTGGTCTCTAf 1 1 I Cri ATTr CTTAAA AATGAACACGATATTGTTCTCTATACACTTGTGATGAGTTTATCXi 
CGCTGATTA ACTA CCTGATTAGTTATTTTTGGATTA AAAG AGACATCAAACTTGTTAAAA^ 

TAAACCACTci i i CTCCCTCTGACAGCCATCTTAGTCTTTGCCAATGCCaATATGCTCTTCA C T U 11 l ACATCGCC 
TCTrCCTCGTTAAAACAGGGATTGATGTCAACGTTAGTTACTATACCATAGCTCAGCGAATTCTGACCGT^ 
25 TGGCGTTGTAACAGGTGCAATTGGAGTGAGTGTGCCTCGTCTCAGTTACTATCTGGGGAAAGGAGACAAAGAAGC 
CTATGTrrCTCTGGTTAATA GACCT AGTCGAATCTTTAA Cl ' iCl 1 I ATCATTCCACTGAGTTTTGGACTCATGGTTT 
TAGGACCAAATGCCATCCTACriTACGGTAGTGAAAAATATATCGGAGGCGGCATCTTGACCTCTCTCrrCGC^ 
TCGTACGATTATCCTGGCCTTAGATACCATTCTTGGTTCCCAAATTCTCT^ 

ACAGTCTATACAGTCrrTGCTGGGCTACTCAATTTGGGCTTGAATAGTCTC Cl 1 11 ITI CAACCATATCGTGGCTCC 
30 TGAATACTACTTACTGACAACTATGCTATCAGAGACnrrCTCTAC^ 

AACTCATCCACTTGGGACATATCTTrAGCTATACTGTTCGATACTCTCTCTTTT^ 

TCCTGATTAATTTCGTGTATCCTGTAGATATGGTCATTAATTTGCCATTTTTGATTA 

ATACTGGTTTGATTGTCTTGCTATCAGCTATCTCT^ 
^ ^ G AATTTTT AAACCATGTCCTAGCCTTAAAAAATAAATTT AAAAAATC^ 

4179.2 

ATGAAACAAGTAACCGTTGAAGATGCCAAACAAATrGAATTAGAAATTTTGGATTATATTGATACTCT 
AGCACAATATCAACTATATTATTAACTACGGTACTCTGATTGGGGCGGTTCGACATGAGGGCTTTATCC 
. * CGACG ATATTG ATCTGTCCATGCCTAG AGAAGACTACCAACGATTTATTAACATTTTTCAAAAGG AAAAAAG 
4U GTATAAGCTCCTATCCTTAGAAACTGATAAGAACTACITTAACAACTTTATCAAGATAACC^ 

ATTATTGATACTCGAAATACAAAAACCTATGAGTCTGGTATCTTTATCGATATTTTCCCTATAGATCGCm 
TCCTAAGGTCATTGAT^CTTGTTATAAACTGGAAAGCTTCAAACTGCTGT 

TATAAGGATAGCCTTTTAAAAGATTGGATACGAACACCCTTCTGGTTACTCCrrCGACCGGT^ 
TGCAAATAAAATCGAGAAAGAAATTCAAAAATATAGTCGTGAAAATGGGCAATATATGGCTTTTATCCCT^ 
4D ATTTAAGGAAAAGGAAGTCTrCCC AAGTG GTACCTTrGATAAAACAATCGATTTACCCTTTGAGAATT^ 

CCTGCACCTGAAAAATTTGATACTATTTrGACACAATTTTATGGAGATTATATGACCCTACCACCAGAAGAAAAAC 
GCTTCTACAGTCATGAATTTCACGCTTATAAATTGGAGGATTAG 

4179.3 

50 ATGATAAAAATCAATCATCTAACCATCACACAAAACAAAGATrrACGAGATCrrGTATCTGACCTAACCATG 

ATCCAAG ACGG GGAAAAGGTTGCTATTATTGGTGAAGAAGGAAATGGCAAATCAACCTTACTTAAAATTTTAATG 
GGGGAAGCTTTGTCTGATTTCACTATCAAGGGAAACATCCAATCTGACTATCAGTCACTGGCCTACATTC^ 
AAGTCCCTGi^ACCTAAAAAAGAAAACTTTACACGACTA Ci ICl 1 1 11 AGATTCTATTG ATTTAGACTACAGTAT 
CCTCTATCGTITCGCGGAGGAATTGCATTTTGAT^ 

DD GGGGGCGAAGCITTGAAAATrCAGCTTATCCATGAGTTAGCCAAACCCTTTGAGAT^ 

CAAATGACCTAGACC TTGAGACAG TTGATrGGCTAAAAGGCCAGATTCAAAAGACCACGCAAACCGTTATTT^ 
TTTCCCATGATGAAGAv-i i lun i CTGAAACGGCAGACACTATTGTTCACTTGCCACTGGTCAAACACCGTAAAGA 
AGCGGAAACGCTAGTAGAGCATTTAGACTATGATAGCTATAGTGAGCAGAGAAACGCTAATTTTGCCAAACAAAG 
TCAGCAAGCTGCTAACAACCAAAGAGCCTACGATAAAACCATGGAAAAACATCGGAGAGTTAACCAAAATGTAG 

OU AAACTGCGCTTCCAGCTACCAAAGATAGTACTGCCGGTCGCCTATTGGCTAAAAAGATGAAAACTGTCCT 

AAGAAAAACGCTACGAAAAGGCAGCTCAGTCCATGACTCAAAAGCCACTTGAAGAGGAACAAATCCAACTT^ 
TTTCA^ACATCCAACCATTACCAGCrrCTAAAGTCTTAGTCCAACT 

AGTTTTGGTTCAAAAACTACAACTAACTGTCCGTGGCCAAGAAAAAATCGGTATTATCGG 
^C^JJAATGGTGTTCGGAAATCAACTCTGTTAGCCAAGTTACAGAGAC^ 
OD TCGi I I iATGCCACAAGATTACCACAAAAAACTGCAATTGGATTTATCCCCAATAGCCTATCTCACTAAAACTGGG 
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GAAAAAGAGGAACTACAGAAAATCCAATCTCACCTAGCTAGTCTCAATTTCAGTTATCCAGAAATGC AGCA 
ATTCGCTCCTTATCTGGCGGACAACACGCAAAACrCCTGLi I I 1 GGATTTACTCCTGCGCAAACCAAACTTTCTCC 
TGCTGGATGAACCCACACGAAA C1 iri CTCCCACTTCTCAACCCCAAATCAGAAAACTCTTTG 

cggtctcatcactgtttcgcatgaccgtcg rn c 1 r aaaagaagtctgctcgatcatctatcgcatgacagaacac 
5 ggtttgaagctacttaatttagaagatttataa 

4179.4 

atgaaaccaaaaacattttacaacttgctt 

gaacgttattttgagctcttggtcgagtggaatgagaagattaatttgacggcgattacggacaaggaaga^ 

10 TATCTCAAACAi ri I lACGATTCGATTGO\CCCArrCTrCAAGGTTTGATrCCCAATGAAACT iGA 

TATCGGGGCTGGGGCAGGATTTCCTAGTCTACCAATGAAAATTCTCTATCCGGACTTAGATGTGACCATTATTGAT 
TCACTCAATAAGCGCATCAACITCCTACAACTCTTGGCTCAAGAACT 

GACGTGCCGAAGATTTTGCCCAAGACAAGAACTTCCGTGCTCAATATGATTTTGTAACAGCTCCTGCGG^ 
TATGCAGGTCCTATCTGAATTGACTATrCCCTACCTTAAGGTTGGTGGCAAACTATTAGCACTCAAGGCTAGC^ 
15 GCGCCTG ACG AATTATTAG AAGCTAAG A ATGCCCTC AATCTCf ITl " 1 " I'AGT AAGGTCG AAGACAATCTCAGciACG 

CCCTACCGAATAGAGATCCGCGCTAT ATCA CAGTGGTAGAAAAGAAAAAAGAAACACCAAATAAATATCCACGTA 
AGGCTGGTATGCCAAATAAACGCCCACTTTAA 

4179.6 

20 atgagtattaaactaattgccgrrgatatcgacggaacccttgtcaacagccaaaaggaaatcac^ 

tttctgccatccaagatgccaaacaagctggtgtcaaagtcgtgattgcaactggccgccctatcgcaggcgttgc 
ca^cttctagacgacttgcagttgagagacgagggggactatgtggtaaccttcaacggtgcccttgtccaaga 
aactgctacaggacatgagattatcagcgaatccttgacttatgacgattatctagatatggaattcctcagtcgc 
aagctcggtgtccacatgcatgccattaccaaggacggtatctatactgcaaatcgcaatatcggaaaatacact 

25 gtacacgaatcaaccctcgtcaccatgcctatcttctaccgtacccctgaagaaatggctggcaaagaaattgtta 
aatgta'tgtttatcgatgaa ccag aaattctcgatgctgcgattgaaaaaattccagcagaattttacgagcg 
ctccatcaacaaatctcctcctttctacctcgaactccttaaaaagaatgtagacaagggttc^ 
ttggctgaaaaactcggattgaccaaagatgaaaccatggcaatcggtgatgaagaaaatgaccgtgccatgctg 
gaagtcgttggaaaccccgttgtcatggaaaatggaaatccagaaatcaaaaaaatccccaaatacatcaccaaa 

30 acaaatgacgaatccggcgttgcccatgccatccgaacatgcgtactgtaa 

4179.7 

ATGACTTGGATTATTCTTGGAGTTATCGCTCTTATTGTTAl I I 1 1'GTGATTGTTAGCTATAACGGTTTGGTTAAAAA 
TCGTATGCAAACCAAGGAGGCTTGGAGTCAGATTGATGTTCAGTTGAAACGTCCCAATGACCTCITGCCAAACTTG 

35 ATTGAGACTGTAAAAGGTTATGCCAAATATGAAGGTTCTACCCTTGAAAAGGTGGCAGAACTACGTAACCAAGTG 
GCGGCAGCGACrTCACCAGCAGAAGCTATGAAAGCCAGTGATGCCCrCACTCGTCAGGTTTCAGGTA l 11 i IG CAG 
TTGCAGAAAGCTATCCAGATTTGAAAGCTAGTGCTAACmGTTAAATTGCAAGAGGAGTTGACA AACAC AGAi^ 
ATA AAA n I L 1 ' 1 ACTCTCG TC AA CTCTATAACAGTGTTGTCAGC AACTACAATGT AAAATTAG AAACTTTCCCG AG 
CAATATTATCGCTGGAATGTTTGGATTTAAAGCGGCAGATTTCCTTCAAACACCTGAAGAGGAAAAGTCGGT^ 

40 AAAGTTGATTTTAGCGGTTTAGGTGACTAA 

4179.8 

ATGTTCTTTCATCAAATTGCAAGCAATAAACGAAAAACCTGGATTTTGTTGCTGGTATTT^ 
TGTTGG TTAT GCGGTTG GTTATC TCTTTATAAGATCTGGACTTGGTGGTTTGGTTATTGCACTGATTATCGGC^^ 

45 TCTACGCTTTCTCTATGATTTTTCAATCGACAGAGATrGTCATGTCCATGAATGGAGCGCGTGAGGTGGATC 

AACGGCACCAGACCTCTACCATGTAGTGGAAGATATGGCTCTGGTCGCTCAGATTCCTATGCCCCGTGTTTTCATC 
ATTGATGATCCAGCCTTAAATGCCTTTGCGACAGGTTCTAATCCTCAAAATGCGGCTGTTGCTGCGAC^ 
TACTAGCTATCATGAATCGTGAAGAACTAGAAGCTGTTATGGGACATGAAGTCAGTCATATTCGTAATTATGATAT 
CCGTATTTCGACTATTGCAGTTGCCCTTGCTAGTGCTATCACCATGCTTTCTACTATGGCAGGTCGTATGATCT 

50 GGGGTGGAGCAGGTCGCAGACGAAGTGATGATGACCGAGATGGAAATGCTCTTGAAATCATTATGCTAGTGGTTT 
CCCTACTAGCr ATTGTACTGGCACCTCTCGCTGCAACCrrCGTTCAGCTCGCTATTTCTCGTC^^ 
GCAGATGCATCTACTGTCGAGCTGACTCGCAATCCCCAGGGAATGATTAATGCCCTAGATAAGTTGGACAATAGC 
AAACCTAT GAGTC GCCACGTCGATGATGCTAGCAGTGCCCTTTATATCAATGATCCTAAGAAAGGTGGGGCGTTC 
CAAAAACTCTTTTATACCCACCCACCTATCTCAGAACGGATTGAACGTTTAAAACAGATGTA^ 

4179.9 

ATGAAATrAAATATTCAAGAAATTCGTAAGCAGTCTGAAGGTn'GAACTTrGAACAAACGTTAGArrT^ 
ACCTGCGTGCACGTAATCAAGAAATTTTAGATGTAAAAGATATCCTTGCAGTTGGGAAAGTACAATATGAAGACC 
GTATCTATTTCXTAGATTATCAACTATCTTATACCATT^ 
60 GTTGAATCTTATCCAGTCACGGAAGTTTTCATGGAAGGCGCAACTAACCAGCTAGATCAAGAAGTTTTAGA 

ACTTGGTCTTGCCCATCGAAAATGCGGAGCTTGACCTTGCTGAGAGTGTATCAGACAATATCCTGCTAAACA^ 
TATCAAGGTCTTGACCGCTGAAGAAGAACCTGGTCAAGGATTTATCTCAGGAAATGACTGGCAAATCATGACAGA 
GGAAGAATACCAAGCTCAAAAAGCAGTAAAGAAAGAAGAAAACAGTCCTTTTGCTGGCTTACAAGGACTAT^ 
CGGAGATGAATAA 

65 
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4179.12 



ATGGACTTATTTATGAAAATCACAAACTATGAAATCTA TA^^GT TAAAAAAATCAGCTTTGACCAA 
TGAAAGTGCTAGAATACGGTGAAAATCTTGATCAGCAGCl i i i GTTGCCTGATATTGCAGATATCTCAGGTTGCCG 
TAATCCAGCCCii I'l ri ATGGAACGTTATTTTCAGATAGACGATGCGCATTTCTCGAAAGAGT TTCAA A AATT TCC^ 
TCTTTCTCTATTTTAGATGACTGTTATCCTTGGGATTTGAGTGAAATATATGATGCGCCT I IATTTTACAA 

GGGAAATCTTGACCTCCTGAAATTCCCGAAGGTAGCGGTCGTGGGCAGTCGTGCTTGTAGCAAACAGGGAGCTAA 
GTCACTTGAAAAAGTCATTCAAGGCTTCGAAAATGAACTGGTTATTGTCAGTGGTCTGGCCAAGGG^ 
AGCAGCTCATATGGCAGCTCTTCAGAATGGCGGAAAAACCATTGCAGTGATTGGAACAGGACTGGATGTGTTT^ 
TCCTAAAGCCAATAAACGCnTGCAAGACTACATCGGCAATGACCATCTGGTTCTAACTGA^TATGGACCTGGTGA 
ACAACCTCTGAAATTTO^TTTTCCTGCCCGTAATCGCATCATTGCTGGACnTTGTCGTGGT 

GCTAAGATGCGTTCAGGTAGTCTCATTACCTGTGAGCGAGCAATGGAAGAAGGACGCGATGTCTTTGCTATTCCTG 
GTAGCATmAGATGGACTATCA GACG GTTGCCATCATTTGATTCAAGAAGGAGCAAAATTGGTCACCAGTGGGC 
AAG ATGTTCTTG CGG AATTTG AATTTT AA 



ATG AAACGTCAATTAGCCrrGGTCGTCTTTAGTCGTGGTCAAG ATTCAACAACCTGCC 1 ' 1 ' i ' 1 CTGGGTCATGCAAC 

ACTATGAAACAGTCGAAGCTGTCACCTTTGCCTACGGCCAACGTCATCACCTCGAAATTCAAATTACTAGAGAAAT 

CGCTAAGGAACACGGCATTCGTCACCATATCCTCGATATGTCTCTCCTGGGACAAATCACTGCTCAGCCAGACTT^ 

GCGACGATTCATATTTCCTACATTCCTGACAAGCTCTGTGTCGAGTCAAAATCCCTCAAACTATATCTArr^ 

ACCGAAACCACGGAGATTTCCACGAAAACTGTATCAACACCATCGOGAAAGACTTGGTCAACTTGCT^GAC^ 

GCTATTTAGAACTCTGGGGAAAATTCACTCCGCGCCGTGGCATTTCAATCGACCCCTACTACAACTACCC 

AGGAACTAAGTATGAGGGCTTGGCAGAACAACGCCTCTTCCAj^CACGaCCI r i ATCCAGAGAAAATTGACAACCG 

CTAA 



ATGACCGAAACGGTAGAAGATAAj\GTAAGTCATTCAATTACTGGGCTTGATATCCTCAAGGCGATAGTTGCTGCG 

GGAGCTGTCATAAGTGGAACCGTTGCAACTCAAACGAAGGTATTTACAAATGAGTCAGCAGTACTTGAAAAAACT 

GTAGAGAAAACGGATGCTTTGGCAACAAATGATACAGTAGTTCrAGGTACGATATCTACAAGTAATTCAGCGACT 

TCAACTAGrrrGTCAGCTTCAGAGTCGGCAAGTACATCTGCATCTGAGTCAGCCTCAACCAGCGCTTCGACCT 

CAj\GTACAAGTGCATCAGAATCAGCAAGTACATCGGCTTCGACAAGTATTTCTGCATCATCrACTGTGGTAGC^ 

ACAAACAGCTGCCGCTACACAAGCAACTGCTAAGAAGGTCGAAGAAGATCGTAAGAAACCAGCTAGTGATTATGT 

AGCATCAGTTACAAATGTCAATCTCCAATCTTATGCTAAGCGACGCAAGCCTTCAGTGGATTCCATCGAGCAATTG 

CTGGCTTCTATAAAAAATGCTGCTG I IT ITl CTGGCAATACGATTGTAAATGGCGCCCCTGCAATTAATGCAAGTC 

TAAj^CATTGCTAAAj^GTGAGACAAAAGTTTaTaCAGGTGAAGGTGTAGaTTCGGTATATCGTGTTCCAATI^ 

TAAATTGAAAGTGACAAATGATGGTTCAAAATTGACCrrrACCTATACGGTTACGTATGTGAATCCTA^ 

GATCrrGCTAATATATCAAGTATGCGTCCTGGATATTCTATCTATAATTCACGTACTTCAACACAAACAATG^ 

CCCTrGGCAGTGATCTTGGTAAACCTTCAGGTGTAAAGAACTACATTACTGACAAAAATGGTAGACAGGTT CT 

CTATAATACATCTACAATGACGACGCAGCGTAGTGGGTATACTTGGCGAAATGGTGCCCAAATGAATGGTTTCTTT 

GCTAAGAAAGGATATGGATTAACATCATCTTGGACTGTACCAATTACTGGAACGGA 

tacatcctttacatttaccccttacgctgct agaa cagatagaattggaattaactacttcaatggtggag 

gtagttgaatctagcacgaccagtcagtcactttcacagtctaagtcactctcagtaagtgctagtcaaagcgcct 

cagcttcagcatcaacaagtgcgtcggcttcagcatcaaccagtgcctcggcttcagcgtcaaccagtgcgtcag 

cttcagcaagtaccagtgcrrcagtctcagcatcaacaagtgcttcagcctcagcatcgacaagtgcctcggcttc 

agcaagcacatcagcatctgaatcagcgtcaaccagtgcttcggcttcagcaagtaccagtgcttcagcttcagc 

atcaaccagcgcctcggcctcagcaagcacctcagcrtctgaatccgcctcaaccagcgcctcggcctcagcaag 

cacctcagcrrctgaatcggcctcaaccagcgcctcagcctcagcatcaacgagtccttcggcttcagcaagcac 

aagcgcctcgggttcagcatcaacgagtacgtcagcttcagcgtcaaccagtgcttcagcctcagcatcaacaag 

tgcgtcagcctcagcaagtatctcagcgtctgaatcggcatcaacgagtgcgtctgagtcagcatcaacgagtac 

gtcagcctcagcaagcacctcagcttctgaatcggcctcaaccagtgcgtcagcctcagcatcgacaagcgcctc 

agcttcagcaagtaccagtgcttcagcctcagcgtcgacaagtgcgtcggcctcaaccagtgcatctgaatcggc 

atcaaccagtgcgtcagcctcagcaagtactagtgcatcggcttcagcatcaaccagtgccrcggcttcagcgtca 

accagtgcgtcagcttcagcaactaccagtgcttcagtctcagcatcaacaagtgcttcagcctcagcatcgaca 

agtgcctcggcttcagcaagcacatcagcatctgaatcagcgtcgacaagcgcctcagcttcagcaagtaccagt 

GCGTCAGCTTCAGCATCAACCAGCGCCTCGGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCGCC 

tcggcctcagcaagcacctcagcttctgaatcggcctcaaccagcgcctcagcctcagcatcaacgagtgcttcg 
gcrtcagcaagcacaagcgcctcgggttcagcatcaacgagtacgtcagcttcagcgtcaaccagtgcttcagcc 
tcagcatcaacaagtgcgtcaccctcagcaagtatctcagcgtctgaatcggcatcaacgagtgcgtctgagtca 
gcatcaacgagtacgtcagcctcagcaagcacctcagcttctgaatcggcctcaaccagtgcgtcagcctcagca 
tcgacaagcgcctcagcttcagcaagtaccagtgcttcagcctcagctcgacaagtgcgtcggcctcaaccagtg 
catctgaatcggcatcaaccagtgcgtcagcctcagcaagtactagtgcatcagcttcagcatcaacgagtgcat 
cggcttcagcatcaaccagtgcctcggcttcagcgtcaaccagtgcgtcagcttcaccaagtaccagtgcttcagt 

CTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACAAGTGCcTCGGCTTCAGCAAGCACATCAGCATCTGAATCA 

GCGTCGACAAGCGCcTCAGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCGTCGACAAGTGCGTCAGCCTCAGCA 

AGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCATCGGCTTCGGCGTCAACCAGTGCATCAGAGTCAGCAAGT 

accagtgcgtcagcttccgcatcaacaagtgcctcggcttcagcaagcaccagtccgtcggcttcagcaagtact 



4181.1 



4181.2 
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AGCGCCTCAGCCTCAGCCTCAACCAGTGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGACT 
GCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTC.Aj^CAAGTGCATCGGCrrCAGCGTCAACGACT^ 
TCTGAATCGGCATCAACGAGTGCGTCCGCTTCACCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCG 
GCTTCAGCATCAACGAGTGCGTCCCCTTCAGCAAGTACTAGCGCCTCAGCCrCAGCGTCAACAAGTGC^ 
5 CAGCGTCAACGAGTGCGTCTGAGTCAGCATCAACGAGTGCGTCAGCCTCAGCAAGCACATCAGCTTCTGAATCTG 
CATCAACCAGTGCGTCAGCCTCAGCATCGACAAGCGCCTCAGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCGT 
CGACAAGTGCGTCGGCTTCAGCAACTACCAGTGCGTCAGCCTCAGCAAGTACCAGTGCGTCAGCCTCAGCGTCGA 
CAAGTGCGTCGGCCTCAACCAGTGCATCTGAATCGGCATCAACCACTGCGTCAGCCTCAGCAAGTACTAGTGCAT 
CAGCTTCAGCATCAACGACTGCATCGGCTTCAGCATCAACCAGTGCATCAGACTCAGCAAGTACCAGTGCGTCAG 
-10 TTCCCCATCAACAAGTGCCTCGGCTTCAGCAAGTACTAG 

4183.1 

ATGGGGGTCGAAACTTGGTTTTATTCTAGCATCTGCTGGCTGCCCATCGGGCTTGGTTCCGTTTGGAA 
ACATGACTGCTCCTAATGGCGGTGGAGG L Ml I i ACTAATCTTrCTCATTTCCACTATTTTAATCGGTTTCCCTCTC 

15 CTGCTGGCTGAGTTTGCCCTTGGCCGTAGTGCTGGCGTTTCCGCTATCAAAACCTTTGGAAAACTGGCC^ 

ACAAGTACAACTTTATCGGTTGGATTGGCGCCTTTGCCCTCTTTATCCTCTTAT Cl 1 1 1 1 ACaGTGTTATCGG AGG A 
TGGATTCTAGTCTATCTAGGTATTGAGTTTGGGAAATrGTTCCAACTTCGTGGAACGCGTGATTATGCTCAGTTAT 
TTACTTCAATCATTTCAAATCCAGCCATTCCCCTAGGAGCTCAAGCGGCCTTTATCCTATTGAATATCTT CA^ 
TCACGTGGCGTTCAAAAAGGGATTGAAAGAGCTTCGAAAGTCATGATGCCCCTGCTCTTTATCGTCl i ibl 1 1 I lA 

20 TCATCGGTCGCTCrCTCAGTTTGCCAAAT<K:CATGGAAGGGGTTCrrrACTTCCT 

G ACTAGCACTGGTCTCCTCTATGCTCTGGGACAAT C 1 ' I T C 11 i GCCCTCTCACTAGGGGTTACACTCATGTTGACCT 
ATGCri'Cl lACTTAGACAAGAAAACCAATCTAGTCCAGTCAGGAATCTCCATCGTAGCCATGAATATCTCGATATC 
CATCATGGCAGGTCTAGCCATTTTCCAAGCTCGATCCCCCTTCAATATCCAGTCTGAAGGGGGACCCAGCCTGCTC 
TrrATCGTCTTCCCTCAACTCTTTGACAAGATGC Ll I 1 i GGAACCATTTTCTACGTCCTCTTCCTCTTGCTCTTCCTT 

25 TTTGCGACAGTCAC i i 1 1 1 CTGT CGTGATGCTGGAAATCAATGTAGACAATATCACCAACCAGGATAACAGCAAAC 

GTGCCAAATGGAGTGTTATTTTAGGAATTrTGACCTTT 

GCGGATGTTCACArrrflGGTAAGAC Crrcrri GACGCTATGGA Cl 1 CIT GGTTTCCAATCTCCTCATGCCATTTGG 
AGCTCTCTACCTTTCALi l I I l ACACGCTATATCTTTAAAAAGGCTCTTGCAATGGAGGAACTCCATCTCGATGAA 
AGAGCATGGAAACAAGGACTGTTCCAAGTCTGGCTCTTC Cl ICI I CG l i 1 Crr CGTTTCGTCATTCCAATCATCATC 
30 ATTGTGGTCTTCATTGCCCAATrrATGTAATCAj\AAAGGACTTGAGTAG 

4183.5 

ATGTTGAAAAAATGGCAGTTAAAAGATGTTATCTTGCTTG Ll 1 I C i 1 GTCTAT Crrrrri GGTGGG G ri l ' rCCTTGG 
TTCAGGATATGTGTATAATATTCTCAGTCrACTCTTAACACCTCTTGGTTTGCAGGCCTTTGCCAATGAAATC 

35 TCGCTCTCTGGTGTATGGCTGCGCCCATTGCrrGCCATCTTTGTTCCGAGAGTCGGAAGTGCAACGATtGGAGAAOT 
GCTAGCrGCGCTTGCr GAAG TCCTTrATGGTAGCCAATTTGGTCTAGGAGCT Cl I ' l 1 GTCTGG CrriO i I CAAGGIT 
TGGGAAGTGAATTTGGTTTTATCGTAACTAAGAATCGCTATGAAAGTTGGCTCTCTCTAACTGCTi^ 
G ATTACG<nTGT TAGC TTTGTCT ATG AATACATT AAGTT AGGTTACTACGC C ' l 111 CCCTTCCGTTTGTCCTTTCCTT 
GCTTGTGCTACGTTTTATTTCTGTTTA 1 I'l CI"! CTGTACCATCTTGGTTCGTGCCATTGTCAAACTCTATCATCAGTT 

40 TGCAACTGGAGGAAAACCATAG 

4183.6 

ATGCTCAAAGTAGCA ACCC AGACACCGATTATCAGTCT C [ 1 L ri GCTGATTTTATCCTTGGAAACATCTTTCATTCC 
TTCGATTGCTCTGACTCTTTCGGTaGTCGCATTTTGTATTCT 
45 CrTGGATGATCATACTTGCCATTTTACCATC 1 li 1 GCCAACTACTGGGCAGTTCAGTTACACGGAGATGCTTCACA 

ggcagtcatgcttggaacgagcgcctttgtgacagtttgtatcggccttgtctttgtttcctct 
agcttctcttgtacttggctcaaaaggggctatcacgctcttggtcctatgccttgatt 

CCTCTCATTCAGCAAGAAATCAAGTCCCTCAAAGA^GCTTGCCTATTACGTGGTCAAGAACTACAI 1 i 1 IGGTCGC 

ccttgatttacagtaaggttctgatgacagtctttaggtggcgccatctttacctgagagctctatct 

50 ATATGACGAA CATGCACAGTTGAAGAATAGCTATCGGA CI ' 1 rn AT ATTCCT AAA aaaacaaaattaatctacctg 

LI 1 riLl I i 1 1 ATTGCTTCAAACCAGTCTAi ri I I ATAA 

4183.7 

ATGAGAAAGCACCAATTACAAGTTCACAAATTAACCATTTTATCTATGATGATTGCCCTTGATGTAGTCCTTACA 
55 CTATCTTTCGAATTGAGGG>au\TGGCACCGATGTCCAGTGTAGTCAATATTCTAGCAGGAATCATGATGGGACCTGT 
TTATCCCTTGGCTATGGCTACAGTCACAGCC 111 ATCCGTATGACGACTCAAGGGATTCCGCCTTTAGCTCTCACA 
GGAGCGACTTTTGGAGCCCTTCTAGCAGGTCrCTTTTATAj\GTACGGTC 
AGATTTTGGGAACAGGTATTATTGGTTCCATTGTTTCCTATCCTGTTATC 

C TTAGC TGGTTTATCTACACGCCTCGA 11111 CGGAGCAACGTTGATTGGTACAGCGATTTCCnTATTGCCTTTCG 
60 ATTTTTAATCAACCAGGAATTCTTTAAAAAAGTGCAGGGATA l MCM 1 AGTGAAAGGATAGACTGA 

4183.8 

atgcagcaatttacaaa tccc rrrcctataggctctagttccctcattcactgcattaccaatgaga l 1 ill f gtga 
gatgctgccaaatgggattttggctctgggatgcaaacctgtcatggcagatgattcccgtgaagttcttgatt^ 
65 actaagcaaagtcaggctctcttcatcaatttggggcatttgtcagctgagaaggaaaaagcaatccgcatggca 



% % 

62 



10 



GCTTCGTATGCAAA CCAAT CTrCTCTCCCCATGGTAGTAGATGCGGTrGGCGTAACGACTTO\TCCATTCG 
GCTTAGTTAAAGACCl 1 I I AGACTATAGACCTACGGTCCTTAAAGGAAACATGTCACAAATTCGAAGTCT TGTTGG 
ATTAAAGCACCACGGCGTTGGGGTCGATGCGAGTGCTAAAGATO\AGAAACGGAGGATTTGCTTCaaGTC^ 
AGACTGGTGTCAG ACCTATCCTGGTATGT Crn C 1 I AGTCACAGGTCCCAAGGACCTCGTCGTTTCGAAAAATCAG 
GTCGCTGTACTGGGAAATGGCTGTACTGAATTaGACTGGATAACaGGGACACGAGACTTGGTTGGAGCCTTAACA 
GCT Gri i ri CTCAGCCAAGGAAAGACTG Gli 1 r GAAG Cl ' I CTl GCTTAGCAGTCTCTTATCTCAATATCGCTGCTGA 

gaaaatagttgttcaaggaatgggattggaagaatttcgttaccaagtactcaatcagcrrrcgctccti^ 
agatgaaaattggctagataccatcaaaggagaggtttatgaatag 



4185.3 

atgaaccataaaatcgcaattttatcagatgttcatggcaatgcgacggcgctagaagcagtgattgcagatgct 

15 AAAAATC AACGG GCC AGTG AATATTGGCTTCTGGG AG AT A I ' I'l T I L" 1" I'CCTGGTCC AGGCGCAAATG ACTTAGTCG 

CCCTGCTAAAGGACCTTCCTATCACAGCAAGTGTTCGACGCAATTGGGATGATCGTGTCCTTGAGGCTT^ 
GCAATATGGCTTAGAAGACCCACAGGAA GTTCA GCTCTTGCGTATGACACAGTATTTGATGGAGCGAATGGATCC 
TGCAACGATTGTCTGGCTACGAAGCTTGCCrrTGCTGGAAAAGAAAGAj\ATTGACGGATTGCGCri MCTATCTCT 
CATAATTTACCTGACAj\AAACTATGGTGGTGACTTGCTAGTTGAGAATGATACACAGAAATTTGACCAACTGCT 

20 GATGCGGAAACGGACGTGGCAGTTTATGCTCATGTTCACAAGCAGTTGCTTCGTTATGGAAGTCAAGGGC>^ 

ATCATCAATCCAGGCTCGATTGGCATGCCCTATTTTAATTGGGAGGCGTTAAAAAATCACCGTTCCCAGTATGCCG 

tgatagaagttgaagatggggaattactcaatatccaarrrcgtaaagttgcttatgattacgaagctgagttaga 
attggccaagtccaaggggcttccctttatcgaaatgtatgaagaactgcgtcgtgacgataactatcaggggca 

CAATCTGGAATTATTAGCCAGCTTAATAGAAAAGCATGGGTATGTAGAGGATGTGAAGAA l - i rrriU GA l l 1 i I I G 
25 TAA 

/ 

4186.1 

ATGAATGTAAATCAGATTGTACGGATTATTCCTACTTTAAAAGCTAATAATAGAAAATTAAATGAAACATT^ 
TTGAA ACCC TTGGAATGAAGGCCl lOI 1 AGAAGAATCGGCCTTTCTGTCACTAGGTGACCAAACGGGTCTTGAAAA 

30 GCTGGTTTTAGAAGAAGCrCCCAGTATGCGTACTCGTAAGGTAGAGGGAAGAAAAAAACTAGCTAGATTGATTCT 
CAAGGTGGAAA ATCCC TTA GAAAT TGAAGGAATCTTATCTAAAACAGATTCGATTCATCGATTATATAAAGGTC^ 
AAATGGCTACGCrrrTGAAATTTTCTCACCAGAAGATGATTTGATTTTGATTCATGCGGAAGATGACATAGCAAGT 
CTAGTAGAAGTAGGAGAAAAGCCTGAATTTCAAACAGATTTGGCATCAA l IILH 1 AAGTAAATTTGAG ATTTCTA 
TGGAATTACATCTCCCAACTGATATCG AAAG 1 1 i H I GGAATCATCTGAAATTGGGGCATCCCTTGATnTATTCC 

35 AGCTCAGGGGCAGGATTTGACTGTGGACAATACGGTTACCTGGGACTTATCrATGCTCAAGTTCTTCGTCAATGAA 
TTAGACATAGCAAGTCTTCGCCAGAAGTTTGAGTCTACTGAATA l 1 11 ATTCCTAAGTCTGAAAAATTCTTCCTTG 
GTAAAGATAGAAATAATGTTGAATTGTGGTTTGAAGAAGTATGA 

4186.2 

40 atgaagtggaccaagattattaaaaaaatagaagaacaaatcgaggcagggatttatcccggagcct li 1 1 ig cg 
tattttaaggacaatcaatggacagagttctatttaggccagagtgacccagagcatggcttgcagactgaggca 
ggactagtttatgacctagctagtgtcagcaaggttgttggggttggcacagtttgtac c i ' l cl ' l gtgggaaatag 

GTCAATTAGATATTGATAGACTGGTA ATAG AI 1 1 n I ACCTGAGAGTGATTATCCAGACATCACTATTCGCCAGCT 
CrrGACTCATGCAACAGACCTTGATCCri I'l ATTCCTAATCGTCATCTITI AACAGCCCCTGAATTAAAGGAAGCG 
45 ATGTTTCA TCTCAA CAGACGAAGTCAGCCAG C LI ri Li ' I l ATTCGGATGTCCA li ' i ' l 1 I GCTGTTGGGCTTTATTTT 

ggaaagaatttrtaatcaagatttggatgtgatmaaaggatcaagtctggaaaccttggggaatgac^ 
taagtttgggccagttgagcttgctgttccaacagttagaggtgtagaggcagccatagtgcatgatcccaaggc 
tcgtctcctgggta gaca tgctgggagtgctggtttattttcgactataaaggatttacaaat li i 1 1 1 agaacac 
TATTTAGCAGATGATTTTGCAAGAGACTTAAATCAAAAI 1 I i I ctcctttggatgacaaggaacgttctttagcat 

50 ggaatrrgg aagg agattggctagaccatacgggctatacaggtacctttatcatgtcgaatcgtcagaagcaag 
aagccactattttcctatcgaatcgtacctatgaaaaggacgagagagctcaatggatattagaccgcaatcaag 

TGATGAACTTGATTCCCAAAGAAGAGTAA 
4187.2 

55 ATGATGAAGAAGACTTATAATCATATTTTGCTCTGGGGAGTCA l 1 1 1 CTATAGCATTTGCATTGTCTG I ' 1 ' I ' l ' l GCTT 

TACT CCTC AAGAACAATCTACCGTGGGAGTGGGAACTCCAGGTATTCAGCATCTTGGACGCCTGG l 11 MLl 1 1 I G 
ACTCCnTTCAATTCTCT CTCGAA ACTGGGCGAAGTGAGTGACATTGGACAArrATGTTGGA rrri 1 1 l ACA^ 
TCCTCA ATGTCT T CTTG i 1 Tl I'l CCTCTGATTITCCAACTCCTTTATCTATTTCCAAATTTGCGGAAAACAAAAAAG 
GTCCT TCTTT TTAGrrL I'CrrGTGAGTCTrGGAATCGAGTGTACGCAATTAATCTTGGA Cri 1 1 ICi 1 I GATTTCAAT 

60 CGCGTCTTTGAGATTGATGATTTGTGGACCAACACITTGGGTGGCTATCTGGCTTGGCTCCTTTATAAACGATT^ 
ATAAAAACAAGGTAAGGAATTAA 

65 
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4!88.1 

ATGAAGATTCCTCTCT7AA C1 1 I ' l GCAAGGCATAAA I ' l l 0 1 Tl ATCTCTTGCTTACTTTGCri'l 1 ILi iGCnTGCnT 
TATCGTGATGTTTTGATGACTTA I i IH U i 1 i GATATTCATGCGCCCGATCTAGCTA.A-^TTCGATGGACAACCAAT 
TAAAAATGACTTATTAAAATCAGCATTAGATTTTCGTATTCTCCAGTTCAATCTAG^^ 
5 TTCCAATCATCATTGTTTTCCTAGGTTTTCAATATATTGAG 

AGAAGTGAGTTATCAAGGGTTAAAAAGAAAGTTGACTTTGCAAGTTGCAAGTATCCCTTGTT^ 
ACTGTGCTGATAATrGCAATTATAACCTA l llLl 1 I GGGA H 1 1 1 I CTCCT CTTGG ATGGAATTCTCTATTTrCTGAT 
GGAAGTGGTTTACAAAGACTCCTAGATGGAGAGATAAAAAGCTAri tO i'l C M i ACTTGTGTCCTACTAATC GGTA 
TTTTCATCAATGCAATCTATTTTITACAAATAGTTGATTATGTGGGGAATGTGACTCGTTCGGCAA TC 
10 ATGTTTCTTTGGCTTGGTTCTATGCTGCTTrATAGTGCCTTGCCrrACTATATGGT^ 

AGCTAGCTATGCGGATG TAAG TTrGATGAAACTCTTTACTCCTTATATCCrrrTATATTGT 
AAAAATATGAAGATAATGTTTAA 

4188.2 ____ 
15 ATGAAGATAATCrrTAAGAATTITAACAATATTTTGCTAAATAGAAAGATTGTTTTA CT 

GATGATTTTGATAAACCATCTATTGTCAACAGCGCTTCAAAAGCAGGATGCTGTTATCTTTTTC ^ 
ATrrCAATTTTTTCCTATAATGACTATTCTGAAGCGAATTTAGAAATCCCCAAACTATTGTT^ 
CATCGTAGGATGGCTCTCTGTCATTrrACTTGAAAGTGATTTGGCAGACCATTACCATCAC^ 
TCAAGCTCCTTTTTCGATTATACAAGGAAACGATTGGTTGTCATTTCTAAATTTT^ 
20 G li iCl iG GTTTACrrCCTCTAGGAATTCATTTCAAAACAGTCGCA Cri ' i ' lCl ' ri ' i 1 ACTTGCTCA GTrAAT G ATGTT 

GTACTTACTACTGTCTTATCTGATAGCACTGATTAGTGCGGGCGCTGG n li i C Ll I 1 1 1 r CTCTATTTTTTAGCATT 
TGTCGGACAAGAATGGATGATGGATCATATTCTAACAGTGTATTTAGTACTCTTAAGTTTATTAGTTATGT^ 
GTTAGTCGCTTGGAAGAGAAATTTAAGAAAGGATAA 

25 4188.5 

ATGGGCA>u\GGAGAGATGGGCAAAGGAGTTATTGGCTTGGAGTTCGACTCAGAAGTATTGGTCAACAAGGCTCCA 
ACCGTrCAATTGGCAAATGGTAAAACAGCGACrrTCCTAACCCAGTATGATAGCAAGACCTTG^ 
ATAAGGAAGATATCGGACAGGAAATTATTGGTATAGCTAAAGGAAGCATCGAAAGTATGCATAATCTTCCTGTAA 
ATCTAGCAGGTGCCAGAGTTCCTGGCGGAGTAAATGGTAGCAAAGCAGCGGTGCATGAAGTrCCAGAArrTACAG 
30 GGGGAGTTAATGGTACAGAGCCAGCTCrrCATGAAATCGCAGAGTATAAGGGATCTGATTCGCrrCTAACTCTTAC 
TACAAAAAAAGATTATACTTACAAACCTCCTCTTGCTCAGCAGGCACTTCCTGAAACACGAAA^ 
CCTCCTAGCTTCACTAGGACTAACAG Cl i iLl i CCTTGGTCTGTTrACGCTAGGGAAAAAGAGAGAACAATAA 

4188.10 

35 atgtttaaagttttacaaaaagttggaaaag ll 1 1 i atgttacctatagctatacttcctgcagc aggtct aci i i i 
ggggattggtggtgcactttcaaacccaaccacgatagcaacttatccaatactagacaatagtatttttcaa 
atattccaagtaatgagctctgcacgagaggttgtattcagtaatttgtcactacttctctgtgtgggattatgta 

TTGGCTTAGCGAj^ACGAGATAAAGGAACCGCTCCGTTAGCAGGAGTAACTGGTTACTTAGTTATGACTGCAACGA 
TCAAAGCTTTGGTAAAACnTTTTATGGCAGAAGGATCTGCAATTGATACTGGAGTTATTGGAG 
40 AATAGTTGCCGTATATTTGCACAACCGATATAACAATATTCAATTACCTrCCGCTTTAGGATTCTTrG GAGGT TC^ 
CGCrrCGTTCCTATTGTT ACATCGTTCTCTTCT ATCTTG ATTGGCrTTGTC 1" I C r r 1 GTTATTTGGC CACCTT TCC AA 
CAA Cl ICl lOl 1 i CTACAGGTGGATATATrrCTCAGGCGGGTCCAATTGGAACTTTTCTATATGGAri 1 i lAATGAG 
ACTTTCTGGAGCAGTAGGCTTACATCATATAATTTACCCTATGTTTTGGTATACTGAACTTGGTGGTGT^ 
TTGCAGGACAAACACTGGTTGGAGCTCAAAAAATA n I ' l ' l ' l GCTCAATTAGCCGATTTGGCCCATTCTGGATTATT 

45 tacag aagg aac aagg i n r m gcacgtcgtttctc aacaatg atgttcggtttaccggctgcctg tttagc gatg 
taccatagtcttcctaaaaatcgtcgtaaaaaatacgcgggtttgttttttcgagttgctttaacatc^^ 

CGGTATTACAGAACCAArrGAArTTATGTTTCTATTCGTCAGTCCGCTTCTATATGrrGTTCACGCATTCCT^ 
GTGTTAG Cl ICH 1 ATTGCAGACGTC^TAAATA^T^CAATAGGAAACACATTTTCAGGAGGTGTAATCGATT TCACT 
TTATTrGGAA rri lGCAGGGGAACGCTAAGACGAATTGGGTTCTTCAGATTCCATTTGGACTTATTTGGAGTu i i 1 1 
50 GTATTATATTA r 1 I ' ll AGATGGTTCATTACTCAATTCAACGTTCTAACGCCAGGGCGAGGAGAAG AAGTAGATTCT 

aaagaaatttctgaatccgcagattcaacrrcaaatactgcagattatttaaaacaggatagcctacaaat^ 
gagccttgggtggatcaaataatatagaagatgtagatgcttctgtgacacgtttacctgtagctgtaaaaga.ag 
ttaatcaagttgataaagcacttttaaaacaaattggtgcagttgatctct^ 
aatctatggagcaaaagcaatcttatataaaaatagtattaatgaaattttaggtgtagatgattaa 

55 

4188.11 

atgaaatttagaaaj\ttagcttgtacagtacttgcgggtgctgcggttcttggtcttgctgc^ 
gcggaagtaaagatcctgccaaatcaggtgctcacggtgccaaaacagaaatcacttggtgggcattcccagtat 
ttacccaagaaaaaactggtgacggtgttggaacttatgaaaaatcaatcatcgaagcctttgaaaaagcaaac^ 
60 cagatataaaagtgaaattggaaaccatcgacttcaagtcaggtcctgaaaaaatcacaacagccatcgaagcag 
gaacagctccagacgtactctttgatgcaccaggacgtatcatccaatacggtaaaaacggtaaattggctgagt 
tgaatgacctcttcacagatgaatttgttaaagatgtcaacaatgaaaacatcgtacaagcaagtaaagctggag 
acaaggcttatatgtatccgattagttctgccccattctacatggcaatgaacaagaaaatgttacaagatgctgg 

AGTAGCAAj\CCTTGTAAAAGAAGGTTGGACAACTGATGATTTTGAAAAAGTATTGAAAGCACTTAAA 

65 ttacacaccaggttcattgttcagttctggtcaagggggagaccaacgaacacgtgcctttatctctaacctttat 
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agccgttctgtaacagatgaaaaagttagcaaatatacaactgatgatcctaaattcgtcaaagctcttga aaaa 
gcaactagctcgattaaagacaatttcatcaataatggttcacaatttgacgctgggg^ 
gco^acggtcaaacatcttacacaatcctttgggcaccagctcaaaatggtatcc^ 
agtaaggtagaagtggtagaagtaccattcccatcagacgaaggtaagccagctcttgagtaccrtgtaaacggg 

5 tttgcagtattcaacaataaagacgacaagaaagtcgctgcatctaagaaattcatccagt^ 

aaggagtggggacctaaagacgtagttcgtacaggtgctttcccagtccgtactrcatttggaaaacm 

ACAAACGCATGGAj^ACAATCAGCGGCTGGACTCAA^ 

AAATGAGAACACr II GGTTCCCAATGTTGCAATCrCTATCAAATGGTGACG AJ^AAACCAGCAGATGCTT^GAAAG 
CCTTCACTGAAAAAGCGAACGAAACAATCAAAAAAGCTATGAAACAATAG 

10 

4188.12 

ATGCAATCTACAGAAAAAAAACCATTAACAGCCTTTACrCTTATTTCAACAATa\TTTTGCT 
TCTTCATCTTTCCATTCTACTGGATTTTGACAGGGGCATTCAAATCACAACCTGATACAATTCTT^ 
TGCTTCCCTAAAATGCCAACCATGGAAAACTTCCAACAACTCATGGTGCAGAACCCTGCCTTGCy^ 
15 AACrCAGTATTTATCTCATTCGTAACCATGTTCTTAGTTT 

AAAACCTTTCTATGGTCAACGCATTCrATTTGCTATCTTTATCGCTGCTATGGCCCTTCCA 

TACCATTGGTACGTATCGTCAACTTCATGGGAATCCATGATACTCTCTGGGCAGTTATCTTGCCTTTGATTGGAT^ 
GCCATTCGGTGTCTTCCTCATGAAACAGTTCAGTGAAAATATCCCTACAGAGTTGCTTGAATCAGCTAA^ 
GGTTGTGGTGAGATTCGTACCTTCTGGAGTGTAGCCTTCCCGArrGTGAj\ACCACGGTTTGCAGCCCTTGC^ 
20 TTACCTTCATC AATACTTGG AATG ACTACTTCATGCAATTCGT AATGTTG ACTTCAC^ 1 1 1 0 ACCATCTCA 

CTTGGGGTTGCGACCATGCAGGCTGAAATGGCAACCAACTATGGTTTGATTATGGCAGGAGCTGCCCTTCCT 
TTCCAATCGTCACAGTCnTCCTAGTCTTCCAAAAATCCTTCACACAGGGTATTACTATGCGAGCGGTCAAAGGATA 
A 

25 4191.1 

ATGAAAAAAACl 111 llCl I ACTGGTGTTAGGCnU 11 riGC Cl ICU CCACTCTCTG rri IT GCCATTGATTTCAAG 
ATAAACrCTTATTAAGGGGATTTGTATATTCATGCAGACAATACGG^ 

TTGAGGAGGACTTTAAGGGCCAAATCGTGGGACTTGGACCTGCTGGTAAGATGCCTAGCGGGTTTGACATTGACC 
CTCATCCAAAGATTCAGGCCGCGAAAAACGGTGCAGAACTAGCAGATGTGACTAGCGAAGTAACAGAAGAAGCG 

30 GATGGTTAT ACTGT G AGAG TCTATAATCCAGGTCAGGAGGGCGACATAGTTGAAGTTGACCTCGTCrGGAACTTA 
AAAAATTT AC 1 ' i i l CCTTTATGATG AT ATCGCTG AATTAAATTGGCAACCTCTG ACAG ATAGTTCAG A GTCT ATTG 
AAAAGTTTGAATTTCATCTAAGGGGAGACAACGGGGCTGAAAAACTCT 1 1 IICCATAC AGGG AAA Cn "11 1 AGAG 
AGGGAACGATTGAAAAGAGTAACCTTGATTATACTATCCGTTTAGACAATCTTCCGGCTAAGCGTGGAGTTGAG^ 
GCATCCCTATTGGCCTCGGACCGATTTTGCTAGCGCTAGGGATCAGGGATTGAAAGGGAATCGTrrAGAAGAGTTT 

35 AATAAGATAGAAGACTCGATTGTTAGAGAAAAAGATCAGAGTAAACAACTCGTTACTTGGGTCCTCCCTTCGATC 
CTTTCCATCTCCTTGTTATTGAGTGTCTGCTTCTAl'ri I ATTTATAGAAGAAAGACCACTCCTTCAGTCAAATATGC 
CAAAAATCATCGTCTCTATGAACCACCAATGGAATTAGAGCCTATGGTTTTATCAGAAGCAGTCTACTCGACCTCC 
TTGGaGGAAGTGAGTCCCTTGGTCAAGGGAGCTGG AAAATTCACL I I 1 GATCAACTTATTCAAGCTACCTTGCTAG 
ATGTGATAGACCGTGGGAATGTCTCTATCATTTCAGAAGGAGATGCAGTTGGTTTGAGGCTAGTAAAAGAAGATG 

40 GTTTGTCAAGCTTTGAGAAAGACTGCCTAAATCTAGLl 1 I 1 1 CAGGTAAAAAAGAAGAAACTCTTTCCAA ri'lGTT 

TGCGGATTACAAGGTA TCTGAT AGTCTTTATCGTAGAGCCAAAGTTTCTGATGAAAAACGGATTCAAGCAAGAGG 
GCTTCAACTCAAATL'l ICl 11 IGAAGAGGTATTGAACCAGATGCAAGAAGGAGTGAGAAAACGAGTTTCCTTCTGG 
GGGCTCCCAGATTATTATCCTCCTTTAACTGGTGGGGAAAAGGCCTTGCAAGTGCGTATGGGTGCCTTGACTA 
TGCCC CTAT TTATCGGATT TGGT TTGTTCTTGTACAGTTTAGACGTTCATGGCTATCTT^ 

45 CTTGGTTTTCTAGGGTTAGTTTTCTCrGl 1 1 1 CT ATT ATTGGAAGCTTCG ACT AG AT AATCGTGATGGTGTTCT AAA 

TGAAGCGGGAGCTGAGGTCTACTATCTCTGGACCAGrrTTGAAAATATGTTGCGTGAGATTGCACGATTGGATCAG 
GCrGAACTGGAAAGTATTGTGGTCTGGAATCGCCTCTTGGTCTATGCGACCTTATTTGGCTATGCGGACAAGC^ 
GTCA TTTG ATGAAGGTTCATCAGATTCAj\GTGGAj\AATCCaGATATCAj\TCTCTATGTAGCTTATGGCTGGCACAG 
TACGTrrTATCATTCAACAGCACAAATGAGCCATTATGCTAGTGTCGCAAATACAGCAAGCACCTACTCrrGTATCT 

50 TCTGGAAGTGGAAGTTCTGGTGGTGGCTTCTCTGGAGGCGGAGGTGGCGGCAGTATCGGTGCCTTTTAA 

419L2 

ATGAAAAA AGTAA GAAAGATAnTCAGAAGGCAGTTGCAGGACTGTGCTGTATATCTCAGTTGACAG Ct 1 lliCi 1 
CGATAGTTGCTTTAGCACAAACGCC TGAA ACCAGTCCAGCGATAGGAAAAGTAGTGATTAAGGAGACAGGCGAAG 

55 GAGGAGCGCTTCTAGGAGATGCC GTCTT TGAGTTGAAAAACAATACGGATGGCACAACTGTTTCGCAAAGGACAG 
AGGCGCAAACAGGAGAAGCGATArrTTCAAACATAAAACCTGGGACATACACCTTGACAGAACCCCAACCTCCAG 
rrGGTTATAAACCCTCTACTAAACAATGGACTGTTGAAGTTGAGAAGAATCGTCGGACGACTGTCCAAGGTGAAC 
AGGTAGAAAATCGAGAAGAGGCTCTATCTGACCAGTATCCACAAACAGGGACTTATCCAGATGTTCAAACACCTT 
ATCAGATTATTAAGGTA GATG GTTCGGAAAAAAACGGACAGCACAAGGCGTTGAATCCGAATCCATATGAACGTG 

60 TGATTCCAGAAGGTACACTTrCAAACAGAATTTATCAAGTGAATAATTTGGATGATAACCAATATGCAATCGAATT 
GACGGTTAGTGGGAAAACACTGTATGAACAAAAAGATAAGTCTGTGCCGCTGGATCTCGTTATCTTGCTCGATAA 
CTCAAATAGTATGAGTAACATTCGAAACAAGAATGCTCGACGTGCGGAAAGAGCTGGTGAGGCGACACGTTCTCT 
TATTGATAAAATTACATCTGATrCAGAAAATAGGGTAGCGCrTGTGACrTATGCTTCCACTATCT^ 
GAGTTTAC AGTA GAAAAAGCGGTACCAGATAAAAACGGAAAGCGATTGAATGATTCT C ' rriTri GGAATTATGAT 

65 CAGACGAGTTTTACAACCAATACCAAAGATTATAGTTATTTAAAGCrGACTAATGATAAGAATGACATTGTAGAAT 
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TAAAAAATAAGGTACCTACCGACGC^GA.^GACCATGATGGAAATAGATTGATCTACCAATTCCGTGCCA Crri' lA 
CTCAGAAAGCTTTGATGAAGGCAGATGAGATTrTGACACAACAAGCGAGACAAAATACTCAAAAAGTCATT^ 
ATATTACGGATGGTGTCCCAACTATGTCGTATCCGATTAATTTTAATCATGCTAC Cii i i G CTCCATCATATCAAAAT 
C AACT AAATGCA I" I'M i I AGTAAATCTCCT AAT AAAG ATGG AATACT ATT AAGTC ATTTTATTACGCA-^GCAACTA 
5 GTGGAGAACATA CAAT TGTACGCGGAGATGGGCAAAGTTACCAGATCTTTACAGATAAGACAGTTTATGAAAAAC 
GTGCTCCTGCAGCTrTCCCAGrrAAACCTGAAAAATATTCTGAAATGAACGCGGCTGG^ 
TCCAATTAATGGTGGATATATTTGGCrTAATTGGAGAGAGAGTATTCTGGCITATCCGTTTAATTCT^^ 
AAATrACCAATCATGGTGACCCTACAAGATGGTACTATAACCGGAATATTGCTCCTGATGGGTATGATCTCTTTAC 
GCTAGGTATrGCTATrAACGGAGATCCTGGTACGGATGAAGCAACGGCTACTAGTTTTATGCAAAGTATTTCTAG^ 

10 AAACCTGAAAACTATACCAATGTTACTGACACGACAAAAATATTGGAACAGTrGAATCGTTATTTCCACACCATC 
GTAACTGAAAAGAAATCAATTGACAATGGTACGATTACAGATCCGATGGGTGAGTTAATTGATTTGCAATTGGGC 
ACAGATGGAAGATTTGATCC^GCAGATTACACrrrAACTGCAAACGATGGTAGTCGCTTGGAGAATGGACAAGCT 
GTAGGTGGTCCACAAAATGATGGTGGrrTGTTAAAAAATGCAAAAGTGCTCTATGATACGACTGACAAAAGGATT 
CGTGTAACAGG TCTG TACCrrGGAACGGATGAAAAAGTTACCTTGACCTACAATGTTCGTTrGAATGATGAGT^ 

15 TAAGCAATAAATTTTATGATACCAATGGTCGAACAACCTTACATCCTAAGGAAGTAGAACAGAACACAGTGCGCG 
ACTTCCCGATTCCTAAGATTCGTGATGTGCGGAAGTATCCAGAAATCACAATTTCAAAAGAGAAAAAACnTGGT^ 
ACATrGAGTTTATTAAGGTCAATAAAAATG ATAAAAAACCACTGAGAGGTGCG G I C ' I ' l T AGTCTTCAAAAACAAC 

atccggatta tcca gatatttatggagctattgatcaaaatggcacttatcaaaatgtgagaacaggtgaagatg 
gtaagttgacctttaaaaatctgtcagatgggaaatatcgattatttgaaaattcrgaaccagctgcttat 
20 cgttcaaaataagcctatcg i'l occttccaaatagtaaatgg ag aagtcagagatgtgacttcaatcgttccacaa 
gatataccagcgggttacgagtttacgaatgataagcactatattaccaatgaacctattcctccaaagagagaa 
tatcctcgaactcgtggtatcggaatgttgccattctatctgataggttgcatgatgatgggaggagtrctattat 

ACACACGGAAACATCCGTAA 

25 

4191.3 

ATGAAATCAATCAACAAAl i I 1 1 AACAATGCTTGCTGCCTTATTACTGACAGCGAGTAGCCTG'ri rrCAGCTGCAA 
CAGlTi TIGCCGCTGGGACGACAACAACATCTGTTACCGTTCATAAACTATTGGCAACAGATGGCGATATGGATA 
30 AAj\TTGCAAATGAGTTAGAj\ACAGGTAACTATGCTGGTAATAAAGTGGGTGTTCTACCTGCAAATGCAAAAGAAA 

ttgccggtgttatgtrcgtttggacaaatactaataatgaaattarrgatgaaaatggccaaactcraggagtgaa 
tattgatccacaaacattraaactctcaggggcaatgccggcaactgcaatgaj\aaaattaacagaagctgaacg 
agcraaatttaacacggcaaatrtaccagctgctaagtataaaatrratgaaattcacagtttatc^ 
ggtgaagatggagcaaccttaacaggttctaaagcagttccaattgaaattgaattaccattgaacgatgttgtg 
35 gatgcgcatgtgtatccaaaaj^tacagaagcaaagccaaaaj^ttgataaagatttcaaaggtaaagcaaatcca 
gatacaccacgtgtagataaagatacacctgtgaaccaccaagttggagatgttgtagagtacgaaattgttaca 

AAAATTCCAGCACTTGCTAATrATGCAACAGCAAACTGGAGCGATAGAATGACTGAAGGTTTGGCATTCAACAA^ 

ggtacagtgaaagtaactgttgatgatgrrgcacttgaagcaggtgattatgctctaacagaagtagcaactggtt 
ttgatttgaaattaacagatgctggrttagctaaagtgaatgaccaaaacgctgaaaaaactgtgaaaatcac^ 

40 attcggcaacattgaatgacaaagcaattgtagaagtaccagaatctaatgatgtaacatttaactatggtaata 
atccagatcacgggaatactccaaagccgaataagccaaatgaaaacggcgatttgacattgaccaagacatggg 
trgatgctacaggt gcac caattccggctggagctgaagcaacgttcgatttggttaatgctcagactggtaaagt 
tgtacaaactgtaactttgacaacagacaaaaatacagttactgttaacggattggataaaaatacagaatataa 
attcgttgaacctagtataaaagggtattcagcagattatcaagaaatcactacagctggagaaattgctgtcaa 

45 gaactggaaagacgaaaatccaaaaccacttgatccaacagagccaaaagttgttacatatggtaaaaagtttgt 
caaagtraatgataaagataatcctrragctggggcagaatttgtaattgcaaatgctgataatgctggtcaatat 

TTAGCACGTAAAGCAGATAAAGTGAGTCAAGAAGAGAAGCAGTTGGTTGTTACAACAAAGGATGCTTTAGATAGA 
GCAGTTGCTGCTTATAACGCrCTTACTGCACAACAACAAACTCAGCAAGAAAAAGAGAAAGTTGACA>^ 
GCTGCTTATAATGCTGCTGTGATTGCTGCCAACAATGCATTTGAATGGGTGGCAGATAAGGACAATGAAAATGTTC 
50 TGAAATTAGTTTCTGATGCACAAGGTCGCTTTGAAArrACAGGCCI ICl I GCAGGTACATATTACTTAGAAGAAAC 

AAAACAGCCTGCTGGTT ATGC ATTACTA ACTAGCCGTC AG AAATTTG A AGTCACTGCA A C I ' i C i I ATTCAGCG ACT 
^ GGACAAGGCATTGAGTATACTGCTGGTTCAGGTAAAGATGACGCTACAAAAGTAGTCAACAAAAAAATCACTATC 
CCACAAACGGGTGGTATTGGTACAATTATCTTTGCTGTAGCGGGGGCTGCGATTATGGGTATTGCAGTGTACGCAT 
ATGTTAAAAACAACAAAGATGAGGATCAACTTGCTTAA 

4191.4 

ATGACAATGCAGAAAATGCAGAAAATGATTAGTCGTAT Cri Cn i GTTATGGCTCTGT G Tl ' lTI CTCrTGTATGGGC 
TGCACATGCAGTCCAAGCGCAAGAAGATCACACGTTGGTCTTGCAATTGGAGAACTATCAGGAGGTGGTTAGTCA 
ATTGCCATCTCCTGATGGTCATCGGTTGCAAGTATGGAAGTTGGATGATTCGTATTCCTATGATGATCGGGTGCAA 

60 ATTGTAAGAGACTTGCATTCGTGGGATGACaaTAAA C T I ICl ' lCn I CAAAAAGACTTCGTTTGAGATGACCTTCC 

TTGAGAATCAGATTGAAG TATCTCA TATTCCAAATGGTCTTTACTATGTTCGCTCTATTATCCAGACGGATGCGGT 
TrcrrATCCAGCTGAAlUUCriUM'l'GAAATCACAGATCAAACGGTAGAGCCTTTGGTCATTGTAGCGAAAAAAACA 
GATACAATGACAACAAAGGTGAAGCTGATAAAGGTGGATCAAGACCACAATCGCTTGGAGGGTGTCGGCTTTAAA 
TrGGTATCAGTAGCAAGAGATGTrrCTGAAAAAGAGGTTCCarTGATTGGAGAATACCGTTACAGrrCnTCTGCT^ 

65 AAGTAGGGAGAACTCTCTATACTGATAAAAATGGAGAGATlTTTGTGACAAATCTTCCTCTrGGGAACTATCG^ 
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caaggagctggagccactggcaggctatgctcntacgacgctggataccgatgtccagctcctacatcatcagct 
ggtgacgattacggttgtcaatcagaaattaccacgtggcaatcttcactttatgaaggtggatggtcggaccaa 
tacctctcttcaaggggcaatgttcaaagtcatgaaagaagaaagcggacactatacrcctgttcttcaa^ 
aaggaagtagttgtaacatcagggaaagatggtcgtttccgagtggaaggtctagagtatgggacatactattta 

5 TGGGAGCTCCAj^GCTCCAACTGGTTATGTTCAATTAACATCGCCTGTTTCCTTTACAATCGGGAA^ 

AGGAACTGGTAj\CAGTGCTTAAAAATAACAAGCGACCACGGATTGATGTGCCAGATACAGGGGAAGAAACCCTTG 
TATATCTTGATGCTTGTTGCCATTTTGTTGTTTGGTAG 

4191.5 

10 ATGAGCCACATATACTTATCTATTTTCACAAGTCTCTTGCTGATGCTAGGACXrGTCAATC^ 

AATATTrACGCATCGCTATGGAAGCAGCATATGCTCCCTTTAACTGGACCCAGGATGATGATAGCAACGGAGCTG 
TCAAAATCG ATGG GACCAATCAGTATGCCAACGGATACGATGTTCAAATCGCCAAGAAAATCGCTAAGCACTTAG 
GTAAAGAACCTTTGGTTGTTAAAACCAAGTGGGAAGGTCTAGTCCCTGCCCTTACTTCTGGTAAGA^^ 
T ATCGC AGGTATGAGTCCAACTGCAGAACGCAAACAAGAAATTGCCTTTTCGAGCAGTTACTATACTAGCGAACC 

1 D AGTTTTGCTrGTC AjWu\AGiVn-CTGCCTACGCA^ 

TCTCAAO\AGGGGTCTACCTTTATAACrrGATTGCACAAATCCCAGGTGCTAAAAAAGAAACA^ 
TTCGCTCAAATGCGACAAGCTCTTGAGGCTGGTGTCATTGATGCTTATGTTTCTGAACGTCCAGAAGCACTCACTG 
CTGAAGCTGCGAACTCTAAGTTCAAGATGATTCAAGTAGAACCTGGTTTCAAAACTGGGGAAGAAGATACAGCT 
TCGCTATCGGGCrrCGTAAAAATGACAATCGTArrAGCCAAATCAj\TGCCAGCATTGAAACCATTTCAAAAGATC 
2U ACC AAGTT GCCTTGATGGATCGTATGATCAAGGAACAACCTGCCGAAGCTACAACAACTGAAGAGACTAGCAGTA 
GTTTCTTTAGCCAAGTTGCTAAAATTCTTTCTGAAJ^ACTGGCAACA^ 

ATCTCTATCGTCGGAACCATCATAGGTCTCATTATTGGACTTGCCATTCGTGTCTTCCGTACTGCTCCrCTCTCTGA 
AAACAAAGTCATTTACGGCCTACA^^AAACTAGTCGGCTGGGTTCTCAATGTCTACATTGAAATTT^ 
CCAATGATTGTTCAATCGATGGTTATCTACTATGGAACTGCCCAAGCTTTCGGGATCAACC 
25 TTGACCGTACACTGGCTGCTATCTTCATCGTTTCAATCAATACCGGTGCCTACATGACTGAAATCGTCCGTGGTGG 
TATCCTAGCAGTTGACAAGGGACAATTTGAAGCTGCGACTGCTCTTGGTATGACCCATAACCAGACCATCCGTAA 
GATTGTCCTACCTCAGGTAGTCCGTAACATCCTACCTGCAACTGGTAATGAATTTGTCATCAATATCAAAGATACA 
TCTGTATTGAACGTTATCTCTGTTGTCGAACTTTATTTCTCAGGA^ 

CCAGACArTTACAATCATCGCCGTGATTTACrrrCTCCTCACCrTCACCGTAACACGTATCCTA 
GCAGAATGGACATGGATACCTACACTACAGGTGCTAACCAAATGCAAACGGAGGATTTGAAATAA 

4191.6 

ATGACACAAGCAATCCTTGAAATTAAACACCTCAAAAAATCCTATGGACAAAACGAAGTGCTAAAAGACATTTCA 
CTCACrGTCCACAAGGGAGAGGTCATCTCTATCATCGGAAGCTCTGGAAGCGGAAAATCGACCTTCCTACGCTCC 

i3 ATTAACCTACTTGAAACACCAACTGATGGACAAATCCTTTATCATGGACAAAACGTCCTCGAAAAAGGCTATGAC 
CTCACGCAATACCGTGAAAAGTTGGGGATGGTTTTCCAATCCTTTAACCTCTTTGAAAATCTCAAT 
ACACAATCGTCGCTCAGACAACTGTCCTAAAACGCGAACGCACAGAAGCTGAAAAGATTGCCAAAGAAAACCTG 
GAAAAGGTCGGCATGGGAGAACGCTACTGGCAAGCCAAACCAAAACAACTCTCAGGTGGTCAAAAACAACGTGT 
CGCCATCGCTCCTGCCCTCTCCATGAATCCGGACGCTATTCTCTTTGATGAACCAACATCAGCTCTCGATCCAGAA 

W ATGGTTCGAGAAGTCCTCAAAATCATGCAGGACCTGGCTCAGGAAGGCTTGACCATGATTGTCCTAACCCATGAA 
ATGGAATTTGCCCGTGATGTCTCTCACCGTGTTATCTTTATGGATAAGGGCGTGATCGCTGAAGAAGGTAAACCAG 
AAGACCTCTTCACCAATCCTAAAGAAGACCGAACAAAAGAGTTCCTTCAACGCTATCTCAAATAA 

4192.3 

45 ATGAAAAAGTATCAACTTCTATTCAAAATAAGTGCAGTCTTCTCTTACTTATTTT^ 

GA CGCTTA TCGTCCAAAACTATTGGCAATTTTCTTCTCAGATAGGCAATTTATTCTGGATTCAAAA 

TAci I t 1 lA TTGGAGTC ATGA TTGTGGTTCTTGTTAAGACAGGCCATGGTTATCTCTTCCGCATrCCAAGAAAAAA 

ATGGCnTrGGTATTCGATTTTGACAGTATTAGTGCTAGTCTTCCAGATCTCTTTTAACGTTC 

TTCAGTCAACTGCGGAAGGTTGCGCTGTATTGATTGGTTATAGTGGGACTAACTTrGCAGAGCT^ 

DU AGCCCTGTTCTTTCTGGTTCCACTGATGGAAGAATTGATTTATAGAGGATTACTGCAACATGCrrrC^ 
CGCGATTTGG TCTT GATTTGCTTCTTCCTTCTArrTTATTTGCTCTCCCT^ 
ATATCTTCGTCTTTGCAACAGTTGGAATCATCTTTCCTGGTrrGACCCGCTATACCAAGAGC 
GCGGTGCATGTGATCAATAATATTGTAGCGACCTTCCCGTTTTTCCTCACrTTTCTACAT 

55 4193.1 

ATGAACAAGAAACAATCGCTAGGTCTTGGCCTAGTTGCAGTGGCAGCAGTTGGACTTGCTGCATCTCGTAACCGC 
TCTTCTCGTAACGCAGCTTCATCTTCrGATGTGAAGACAAAAGCAGCAATCGTCACTGATACTGGTGCTGTTGATC 
ACAAATCATTCAACCAATCAGCTTGGGAAGGTTTGCAGGCTTGGGGTAAAGAACACAATCITTCAAAAGAT 
GTTTCACTTACTTCCAATCAACAAGTGAAGCTGACTACGCTAACAACTTGCAACAAG 
AATCTTCGGTGTTGGTTTTGCCCTTAATAATGCACTTAAAGATGCAGCAAAAGAACACACTGACTT 
TTGATTGATGATGTGATTAAAGACO-lAAAGAATGTTGCGAGCGTAACTTTCGCTGATAATGAGTC^^ 
CAGCTCTCGCTGCAGCAAAAACAACTAAGaCAAAACAAGTTGGTTTTGTAGGTGG 

CTCGTTrrGAAGCAGGATTCAAGGCTGGTGTTGCGTCAGTAGACCCATCTATCAAAGTCCAj\GTTGACTACGCTGC 
TTCATTTGCTGATGCGGCTAAAGGTAA AACA ATTGCAGCCGCACAATACGCAGCCGGTGCAGATATTGTTTACCA 
AGTAGCTGGTGGTACAGGTGCAGGTGTCTTTGCAGAGGCAAAATCTCTCAACGAAAGCCGTCCTGAAAATGAAAA 
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agtttgggttatcggtgttgatcctgaccaagaagcagaaggtaaatacacttctaaagatggcaaagaatcaaa 

C M ' i 0 1 1 C r 1 GT ATCTACTTTG A AAC AAGTTCGT AC AACTGT A A A AG ATATTTCT AAC A AGC CAG A AAG AG G AG AA 
TTCCCTGGCGGTCAAGTGATCGTTTACTCATTGAAGGATAAAGGGGTTGACTTCGCAGTAACAAACCTTTC^ 
AAGGTAAAAAACCTGTCGAAGATGCAAAAGCTAAAATCCTTGATGGAAGCCTAAAAGT 
D TCCTGAAAAATAA 

4193.3 

ATGTCTAAAAAATTACAACAAATTTCGGTTCCCTTGATTTCTGTATTCCT 

GTGGATCTTCGGTTATGATGCTATTTGGGGCTACGAAGAATTGTTCTATACAGCCnTTGGCAGTCT^ 
10 GGAGAAATCrrCCGTGCTATGGGTCCTCTGGTCTTGATTGGTCTTGGTTTTGCCG^ Vi ' i Cl ' l 

TAACGTCGGACTTCCTGGTCAGGCTTTCGCAGGTTCGATTCTCAGTGGTTGGTTTGCCCTGTCGCATCCA^ 
CCCCGTCCCTTGATGATTCTAGCAACCATCGTGATTGCCTTGATTGCTGGTGCGATTGTCGGAGCGATTCCAGGTA 
TT CTTAG GGCCTAT CTAG GGACGTCACAGGTTATTGTAACCATCATGATGAACTACATTGTCTTGTATGTA 
TGCCTTTATCCATGCTTTCCCTAAAGACTTCATGCAAAGTACAGATTCGACCATTCGTGTTG 
15 TATCAGA CACC TTGGTTGGCTGAGTTGACTGGTAACTC ACGGA TGAATATTGGTA rrinCI ' iU GCCATCATTGC^ 
TGCAGTTATTTGGTTCATGCTCAAGAAAACAj^CTCITGGTTTTGAAATCCGTGCAGTT 

cagaatatgctggtatttctgccaagcggactattatcctatctatgattatttcaggtgccttggcaggtc^ 

ggagctgttgaagctttgggaaccttccagaacgtctatgttcaaggttcgtcattagctatcggatttaacgg^ 

tggcggttagtttgcttgcggccaactcaccaattggtatactcrrrgcagccttcctatttggcgt^ 

20 GGGGCTCCTGGTATG AATGCGGCGCAGGTACCATCTG AGCTTGTCAGCATTGTAACAGCGTCTATTAT C 1 i CI ' li G 

TCAGTGTTCATTACCTTATCGAACGCTTTGTCAAACCGAAAAAj\CAAGTTAAAGGAGGTA^ 

4194.1 

ATGGGAGTGAJ^AAAGAAACTAAAGTTGACTAGTTTGCTAGGACTCTCTCTGTTAATCATGACAGCCTGTGCG 
25 AATG GGGTAA CTAGCGATATTACAGCCGAATCGGCTGA I 1 1 i I GGAGTAAATTGGTTTA Cl ILII I GCGGAAATCA 

TTCGCTTTTTATCGTTTGATATTAGTATCGGAGTGGGGA^ 

CCAGTCTTTCAGGTGCAAATGGTCGCTTCTAGGAAAATGCAGGAAGCTCAGCCACGCATTAAGGCGCTTCGAGA^ 
CAATATCCAGGTCGAGATA TGGA AAGCA GAAC CAAACTAGAGCAGGAAATGCGTAAAGTATTTAAAGAAATGGG 
TGTCAGACAGTCAGACTCTCTTTGGCCGATTTTGATTCAGATGCC 

30 GAGTTGAcr rrri AAAG ACAGGT CA it l Cl i atgg attaaccttggtactgtggatacaacccttgttcttccgatt 

TTAGCAGCAGTATTCACCTTTTTAAGTACTTCGTrGTCCAACA 

CGATG ATGTATGGG ATTCCAGTCTTG A I'i I II ATCTTTGCAGTTTATGCGCCAGGTGGAGTCGCCCTATACTGGAC 

AGTGTCTAATCCrTATCAAGTCTTGCAAACCTATTTCTTGAATAATCCATTCAAGATTATC^ 

GTAGTACAGGCACAAAAAGATTTGGAAAATAGAAAAj\GAAAj\GCCAj\CAAAAAGGCTCAGAA^ 

4194.4 

ATGGTTATCGATCCATTTGCTATCAACGAACTAGACTATTACTTAGTTTCACACTTCCACAGTGATCATATC^ 

catacacagctgcagcaattctcaataatcctaagttagagcatgttaagtttatcgctccttaccactgtggacg 
aatctgggaaggatggggtcttccaaaagaacgtatcatcgttgrraaaccaggtgacactatcgaattaaaaga 
40 tatgaagattcatgcagtagaatcatttgaccgtacttgcttggtaacrctcccagtgaacggtgctgatgagaca 
ggcggtgaacttgctggcttcgctgttacagatgaagaaatggctcaaaaggctgttaactatatc^ 

CAGGTGGAACCATCTATCATGGTGCaGATTCTCACTTCTCAAj\CTATT^ 

tgatcttgctttgaataactatggtgaaaatccggtaggtatccaagacaaaatgacatctatcgaccttcttcgt 
atggcagaaaatct gcgt accaaagtcattatcccagttcactatgatatctggtctaacrrcatggcttctact^ 
45 atgagattctagaactttggaaaatgcgaaaagatcgcttgcaatacgatttccatccatttatct 

cggtaagtacacttatcctcaagatcaacacttagtagaataccatcatccacgtggttttgatgattgttttgaa 

CAAGACTCTAACATTCAATTTAAAGCTTTGCTATAA 

50 

4196.2 

ATGT TCCTT TCAGGC TGCT TGTCTAGTTTTGCTAATACTTATATCCATGATTTACT^ 

TC CAl 1 t 1 l AAAT GCCTTTGAAAGTGCTATTGCGGCTCCrrTrGGTAGAAGAACCCTTGAAATTATTGTCAC^ 

TTGTTTTGGCTTTGATTCCTGTGCGAAAATTAAAATCTTTGTTTITACTTG 

ATGATTAAGGATATTGGTTATATTCGTACGGiVrTTGCCAGA 

GTATCATCTCAGGAATTCCCTCTCACTGGACTTTTTCAGGTCTAGCTGTAGTAGGTGTTTACTTGC^ 

TATAAAGGACAGAAGGTrGCCAAGAAACAGGGCCrTATTTITCTAGGTTTAGCC^ 

ACTCTCCTTTTGTCGAGTTGGAAACAGAGTTGCCITTAGCGATTC 

oU tatcatccttattgctttgttgagaaacacaatgagttgatgacctag 

4197.1 

ATGAAGGTGGAACCACGTTGCGACGTCCTTTCGAGGATGTCGCA i Nil I ' l ATTAGGATACTAATTATGGAGTTGC 

aagaattagtggagcgcagttcggcaatcccacaagcttatcacgaactggaagttaaccatcatgattccaact 
03 ggaccgtagaagaagacctcttggctttatctaatgatattggaaatttccaacgactggtgatgacaaagca 
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gacgctactatoatgaaacaccctacacactcgaacaaaaactttcacaaaatatctggtggctattagaac^^ 
ctcaacgtttggatatagacattctgacggaaatggaaaacttcctctctcataaacaaaagcaattgaacgtta 
ggacttggaagtag 

5 4197.4 

ATGCTTGATTGGAAACAAl 1 1 ITI CTAGCCTATCTGCGCTCCCGTACTCGTC 1 11 Ti ATCTATCTGCi 1 iCi i i GGC 
A lTlCM GTCTTACTCnTCA On i 1 I ATTTGCCAGTCTAGGAATTTACTTCCTCTA LI ' M I 1 C 1 1 L 1 i GTGTTGCTTT 
GTAACCATATTA i 1 1 1 1 CAC TTGGGAC ATATT GGTGGAAACGCAGGTCTATCGCCAGGAACTTCTCTATGGAGAGA 
GGGAAGCCAAGTCTCCTTTGGAAATAGCTTTAGCaGAAAAATTAGAAGCGCGTGAGATGGAACTCTATCAGCAG 
10 GGTCAAAAGCAGAAAGAAAACTGACGGATTTGCTGGATTACTATACCTTGTGGGTCCATCAGATAAAGA CCCC CA 
TTGCAGCCAGTCAACTCTTAGTTGCAGAAGTGGTCGACCGCCAACTGAAGCAGCAGCTAGAACAGGAAATTTTCA 
AAATCGACTCCTATACCAACCTACTTTTACAGTACCTGCGTTTAGAAAGTTTCCATGATGATT^ 
GGTTCAAATTGAGGACTTGGTCAAGGAAATAATTCGTAAATATGCTCTTTTCTTTAT^ 

AATCTaCATGACCTTGATAAAGAAATCGTGACGGATAAAAAGTGGCTGCTAGTGGTTATTGAGCAAATCATCTCA 
15 AACAGTCTCAAGTACACCAAGGAAGGTGGTCTGGAGATTTATATGGATGACCAAGAGCTTTGTATCAAAGATACG 
GGAATCGGGATAAAAAACAGTGATGTCCTCCGAGTATTTGAACCTCG CMUU I CAGGATACAATGGCCGTTTGACCC 

agcactccrcrggacttggccritatctatctaagaaaatttctgaagaactggggcacc^ 
tgaggtcggaaaaggaacgacagtgcggattcagtttgctcaactgaacttagtccttgagtaa 

20 4211.2 

atggaacttaatacacacaatgctgaaatcttgctcagtgcagctaataagtcccactatccgcaggatgaactg 
ccagagattgccctagcagggcgttcaaatgttggtaaatccagctttatcaacactatgttgaaccgta agaa tc 

TCGCCCGTACATCAGGAAAACCTGGTAAAACCCAGCTCCTGAA C T i ITl - 1 aacattgatgacaagatgcgctttgt 
GGATGTCCCTGCTTATGGCTATGCTCGTGTrrCTAAAAAGGAACGTGAAAAGTGGGGGTGCATGATTGAGGAGTA 
25 CnTAACGACTCGGGAAAATCTCCGTGCGGTTGTCAGTCTAGTTGACCTTCGTCATGACCCGTCAGCAGATGATGT 
CAGATGTACGAATTTCTCAAGTATTATGAGATTCCAGTCATCATTGTGGCGACCAAGGCGGACAAGATTC CTCGTG 
GTAAATCGAACAAGCATGAATCAGCAATCAAAAAGAAATTAAACTTTGACCCGAGTGACGATTTCATCCTCI'1 I ic 
ATCTGTCAGTAAGGCAGGGATGGATGAGGCTTGGGATGCAATCTTAGAAAAATTGTGA 

30 4211.3 

ATGACAAAGAAACAACTTCACTTGGTGATTGTGACAGGGATGAGTGGCGCAGGGAAAACTGTAGCCATTCAGTCC 
TTCG AGG ATCTAGGTTATTTCACCATTG ATAAT ATGCCGCCAGCTCTCTTGCCTAA G I ' l ' l ' 1 i GCAGCTGGTTG AAAT 
TAAGG AAG ACAATCCTAAGTTGGCCTTGGTAGTGG ATATCCGTAGCC G 1 ' i C 1 Tl C r ri ' l CAG AG ATTCAAGCTGTT 

ttgg atgagttgg aaaatc aagatgctttgg atttcaaaatcct c mm gg atgcggctgat aagg aattcgtcg 
35 ctcgttacaaggaaaccagacggagtcacccactagcagcagacggtcgtattttagatggaatcaagttggaac 
gtgaactcttggcacctttgaaaaatatgagccaaaatgtggtggatacgactgaactcactccacgtgagct gcg 
caaaacccttgcagagcagttttcagaccaagaacaagcccagtctttccgtatcgaagtcatgtctitcggat^ 

AAGTATGGAATCCCGATTGATGCGGACTTGGTCTTTGATGTCC Ull iLl i GCCAAATCCCTATTATTT ACCA GAACT 
GAGAAACCAAACCGGTGTGGATGAACCTGTTTATGATTATGTCATGAACCATCCTGAGTCAGAAGACTTTTATCAA 
40 CATTTATTGGCCTTGATTGaGCCGATTCTGCCAAGTTACCAAAAGGAAGGTAAGTCCGTTTTGACCATTGCCATGG 
. GATGTACGGGTGGACAACACCGTAGTCTGGCATTTGCTAAACGCTTGGCGCAGGACTTATCCAAGAATTGGTCTGT 
TAATGAAGGGCATCGCGACAAAGACCGCAGAAAGGAAACGCTAAACCGTTCATGA 

4211.4 

45 ATGAGAAAACCAA^GATAACGGTGATTGGTGGAGGGACTGGAAGTCCCGTCATTCTAAAAAGTCrGCGGGAAAAA 
GATGTGGAAATCGCAGCTATCGTGACGGTGGCAGATGATGGTGGTTCTTCAGGTGAACTCCGAJflu^AAAT^ 
CAGTTGACACCGCCAGGTGATCTTCGTAATGTCCTTGTGGCCATGTCGGATATGCCTAAG l 1 1 1 ATGAGAAGGTCT 
TTCAGTATCGGTTCTCTGAGGATGCCGGAGCCTTTGCTGGCCATCCATTGGGAAATCTCATCATTGCTGGCT^ 
AGAAATGCAGGGTTCAACCTATAATGCCATGCAGTTATTGAGCAAA ITi ' l I ccatacaacagggaaaatttatcct 

50 tccagtgaccatcctttgacccttcatgcagtctttcaggatgggacagaagtggctggagagagtcatattgtag 
accatcgaggcataattgacaatgtctatgtgaccaatgccctaaacgatgatacgcctctggccagccctcgag 
tagtgcagaccatccttgaaagtgacatgattgtcctagggccaggttccctctttacctctattttgcccaatat 
cgtgattaaggaaattgggcgggctcl i riggaaaccaaggcagaaattgcctatgtctgcaatatca tgacc ca 

ACGTGGGGAGACGGAACACTTTACAGATAGCGACCACCTGGAAGTCTTGCaTCGTCACCTTGGTCCCCCI I I lATC 
55 GACACTGTCnTGGTGAATATTGAAAAAGTGCCTCAGGAj^TACATCAATTCCAACCGTTTTGATGAATAC^ 
AAGTGGAACACGATTTTGTAGGTCTTTGTAAGCAAGTTTCGCGCGTGATTTCATCTAAC^ 
TGGCGGTGCCTTCCACGATGGAGATTTGATTCTGGACGAGTTGATGCGCATTATACAGGTGAAAAAATGA 

4213.1 

60 ATGAAAAATTTGATAAAGTTGCTAATAATTAGATTGATTGTTAACTTAGCAGACAGTGTA ri ri ATATAG^ 

TGTGGCAC GTTAGCA ATAATTATTCTTCGAGCATGrrCl-lAGGAATATrrATTGCAGTAAATTATCTACGGGATTTG 
TT ACT AATC 1' 1 ' n T i G G ACCAG TT ATTG AC AG AGT AAATCCGC AAAAA ATTCTT AT AAT ATCAAT I ' 1 ' i GGTTCAATT 
AGCAGTGGCTGTAATATTTITArrATTArrAAACCAAATATCATTTTGGGTGATAATGAGTCTAGTGTrrArrTCAG 
TAATGGCTAG CTCCAT AAGTTACGTGATAGAAGATCTGTrGATTCCTCAAGTGGTAGAATATGATAAGATTGTATT 

65 TGCAAATTCTCriTTTAGTATTTCGTATAAAGTATTAGATTCTAri rin AATTCATTCGCATCATTTT^ 
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CAGTAGGATTTATTTTATTGGTTAAGATAGATATAGGCATAl 1 I'l i ACTTGCTCTATTTATA i'l'GT rCTTCTTAAAA 
TTTAGAACTAGCAATGCGAATATAGAAAACTTCTCTTTCAAATATTACAAGAGAGAAGTC 

TTTAl l I lAAATAATAAATTATTATTTAAAACCAGTAl 1"IC1M"IAACGCTTATAAAC^ I ATTCATTTCAGACA 

GTAGTTGTACCGA ri i I 1 i CTATTCGATATTTTGATCGTCCGA l 1 11 n ATGGTA rri ITl - l AACTATTCCrCCTTTG 
5 GGTGGTATATTGGGAAATATGCTAGCGCCAATCCTAATAAAA 

TTTTTrrGAACGGCrCAAGTTGGTTAGTAGCAATTGTTATAj'WVAGACTAT^ 

GTTTTATGTCTAAAGGACTCTTCAATATTArri l i AATTCGTTGTACCAACAAATACCTCCACATCAACnCl IGGT 
AGGGTAAATACTACCATTGATTCTArrAri ILl i 1 1 GGAATGCCAATTGGTAGTrTAGTTGCAGGAACGCTTATTGA 
TTTGAATATTGAATTAGTGTTAATTGCTATTAGCATACCTTA rri - i ' i ' l G l i i ICi l ATA i i t 1 1 i ATACGGATAATGG 
1 0 ATTG AAAG AATTTAGTATATATTAG 

4213.2 

ATGATGTCTAA CAAA AATAAGGAAATTCTGAi 1 1 1 iGCGATTCTCTATACAGTCCTCTTTATO I 1 IGATGGCGTTAA 
ATTGCTGGCl'i L'l'Il AATGCCATCTGCCATTGCAAATTATCTTGTTTATGTA G I 1 11 AGCTCTATATGGCTC Cri LIT 
15 GTTCAAGGATAGATTGATCCAACAATGGAAGGAGATTAGAAj\GACTAAAAGAAAATTCTTCrr^ 
AGGATGGCTCTTTCTCATrCTGATGACTGTTGTCTTTG AATT rGTATCAGAGATGTTGAA 

ATGGACAAGGTCTAAATCAGTCTAATATTCAAAGTACLri'lCAAGAACAACCACTACTGATAGCTGl 1 I 1 IGCTTG 
TGTCATTGGACCTCTGGTAGAAGAATTA ml 1 CCGTCAGGTCTTATTGCATTACTTGCAGGAACGGTTGTCAGGTT 
TACTaAGCATTATTCTGGTAGGACTTG I 1 1 ! IGCTCTGACTCATATOCACAGTTTGGCTCTATCAGACTGGATTGGT 
20 GCAGTTGGTTACTTAGGTGGAGGCCTTGCCri I I CTATTaTTTATGTGAAAGAAAAAGAGAATATCTACTATCCCC 

TACTTGTTCACATGTTAAGCAACAGCCTCTCCrrAATCATTTTAGCTATCAGTATAGTAAAATGA 



4224.1 

ttgaa aaag ccaattatcgaattcaaaaacgtctctaaagtttttgaagacagcaacaccaaggttctc^ 

atcaactttgagttggaagaagggaaattctacacccttctaggtgcatctggttcggggaaatcaactatcct^ 

acattattgcaggtttactggatgcgacgacaggagatatcatgctagacggtgttcgtatcaatgatatrccaac 

30 caacaagcgcgacgtacataccgtcttccaatcctatgccttgttcccacatatgaatgtgtttgaaaatgttgcc 
tttccacttcgcttgcgtaaaattgataagaaagaaatcgagcagcgtgtagcggaagttctcaagatggttcact 
tggaaggttatgaaaaacgttccatccgcaaactttctggaggacaacctcagcgtgtggccatcgcccgtgcta 
tcatcaaccaaccccctgtggtcttgttcgacgagcctttatcagcgctggacttgaaattgagaacagacatgc^ 
gtacgaattgcgtgaa ttac aacaacgattgcgcattaccrrrgtctrrgtcacrcacgatcaggaagaagctc 

35 gccatgagtcactggattttcgttatgaatgatggcgagattgtccagtctggaacccctgtggacatctacgatg 
agccaatcaaccactttgttgccagctttatcggggagtcaaacatct^ 

ggtcgaatttaacggcaaacgctttgaagcggttgatggtgggatgaagccaaatgaacctgttgaggtcgttat 
tcgtccagaggacttgcgcattacccttcctgaagaaggcaagctccaagttaaggtcgatacccagcttttccgt 
' ggagttcattatgaaattatcgcctatgacgaacttggaaatgaatggatgatccactcaacccgtaaggctatcg 
40 tgggtgaggaaatcggtctggactttgaaccagaagacatccacatcatgcgtctcaatgaaaccgaagaagagt 
tcgatgctcgtattgacgagtacgtagaaatcgaagagcaagaagcaggtttgatcaatgcaatcgaggaggaaa 

G AG ATG AAG AAAAC AAGCTCT AA 
4252. 1 

45 ATGAAATCA ATGAG AATCTTATTTTTGTTAGCTTTAATTCAAATCAGTTTGAGTAGCTGT 

GCATCTT GTCC TTTAAACAAAGTACAGCll 1 I'l'l CATCGGAAGCATGGTTTTCGTTTCAGGAATCTGTGCTGGAGT 

aaattatctttatacccgtaagcaagaagtccatag tgtc ctagccagtaagaagtcggtgaag cri ' l ' i 1 1 acagt 
atgttactctta attaatt tgttaggagct gttctt gttttgtcagataacttgttcat 

AGAATTAGTTGA CTTT TTATTGCCATCC-1 ILl 1 1 i iCCTATTTGGGCTAGATTTGCTGA rri ' n fi ACCCTTG AAAAA 
50 ATACGTGCGCGAl 1 MCI 1 GCTATGCTGGACAGAAAAAAGACAGTGTTGGTGACTATTTTAGCAACA LI ICl 1 1 I C 

TTAAGAAATCCAATGACCA TTGTCTCACT TCTGATTTATATTGGACTGGGCTTGTTTT^ 
AAuATTCGGTTAAGAAGGAAGTTTCCTTTTATG^ 
TCTTTTAG 

55 4252,2 

ATGGTTAAAAAAATTATTGGAATGGTGCTAGCTTTACTTTCTGTAACTGTAGTAGGAGTAGGTGTT^^ 
TATTTATCAACAAGGCACaGAAACCTTACCTAAAACCTATAAAAAAATCGGTGAAGAAACCAAGGTTATTGAAGC 

gactgaacctctaaccattcrgttaatgggagtcgacaccggaaatgttgaacgaactgaaacttgggtcggtag 
aagtgatagcatgatcttgatgacagtgaatcctaaaacgaaaaaaacaacaatgatgagtttagagcgggatat 
oO tctgacgcgcattgaatcagggaatggtcaggctcatgaagcgaaactgaactcagcatatgcagatggtggagc 

AGAGCrTGCTATAGAAACCATTCAAAAAATGATGAATATCCATATTGATCGCTATGTGATGCTCAATATGAGAGG 

ATTGCAAAAACTAGTGGATGCAGTAGGAGGTATTACAGTCAATAATATCCTAGGTTTCCCAATTTCTATCAGTGAC 

CAAGAAGAATTTAATACTATTTCTATCGGTGTTGGGGaGCAACATATTGGGGGAGAAGAAGCCCTAGTCTATGCA 

cgaatgcgttaccaagatcctgagggggattatggtcgtcaaaaacgtcaacgtgaagttattcaaaaactcatc 

o5 GAAAAAGCTCTCAGTTTAAATAGCATTGCTCATTATCAAGAGATTCTAAAAGCTTTGAGTGACAj^T^ 
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AATATTGArrTGTCTGCAAAAACTATCCCrrA.A.CTTGCTAGGCTATAAAGATTCATrrAAA^ 

AGTTGCAGGGTGAAGGAGAGATACTTCAAGGTCTTTCTrACCAGATTGrTTCGAGAGCACATATGTTGGAAATGCA 
AAATCTACTCCGAC G ViC i I' I 'GGGACAAGAAGAAGTTACTCAGCTTGAAACCAATGCGGTTTTATTTGAAGATTTa 
TTTGGCAGAGCACCTGTTGCTGATGAAGATAATTAA 

5 

4256.2 

ATGAAAAAACAAGCCTATGTCATTATTGCTCTCACCTCCTTCCTATTTGT Ci 1 I I ' l ITi CTC CCACA GCTTGC TGGA 
AATACTTGATTTTGACTGGTCTAT Li 1 i 1 I GCACGATGTCGAAAAAACAGAAAAATTTGT L 1 L i 1 i ATTGTTGGTTT 
TCAGCATGTCCATGACCTGTCTCTTAGCCCTGTTTTGGCGAGGGATCGAAGAGCTTTCTCTAAGAAAAATGCAGGC 

10 TAATCTCAAGCGTTTATT AGCAG GGCAAGAAGTGGTTCAGGTTGCAGATCCAGATTTGGATGCCAGTTTCAAGTCC 
TTATCAGGTAAACTTAACC I i I I GACAGAGGCTCTTCAAAAAGCTGAAAATCAGAGCCTTGCTCAGGAAGAGGAA 
ATCATCGAGAAGGAACGGAAGCGAATTGCTCGCGATTTGCACGATACAGTCACTCAGGAGTTGTTTGCGGCCCAC 
ATGATTTTATCGGGTATCAGTCAGCAGCCTTTGAAATTGGATAGAGAAAAGATGCAGACCCAGTTGCAGAGTG 
ACAGCTATTTTAGAAACAGCCCAGAAGGATTTGCGGGTTTTGCTCTTGCATTTGCGACCAGTTGAACT^ 

15 AGAGCTTGATAGAAGGGATTCAAATT Cn 11 AAAAGAGCTTGAGGACAAGAGTGATCTTAGGGTTAGTCTCAAGC 

agaatatgacgaaattccctaagaaaatcgaggagcatatcttccgtatcctgcaagagttgattagcaataccc 
tccgccatgcccaggcatcttgcctagatgtctacctctatcagacagatgttgaattgcaactgaaggtggtgga 
caatgggattggtttccagttagggagcttagacgacttgagttatggactgcgaaatatcaaggagccggttga 
agatatggctggaacagttcaactcttgacacctcccaagcaaggcctggcggtrgatatccgtattcccctcrr^ 
20 gataaggaatga 

4263.1 

ATGATTGTrrCCATTATTTCTCAAGGATTTGTCTGGGCTATTCTAGGTCTGGGAATCTTTATGA 
AAACTTTCCAGATATGACGACAGAAGGTTCCTTCCCTCnTGGGGGAGCTGTTGCTGTCACTTrGATAACCAAAG 
25 GTGAACCCAl'lU"l"lAGCGACACriUl'rGCTGTAGGAGCAGCTTGTTTGGCTGGAATGGCAGCAGGCCi"lC-ri lATA 

CAAAAGGGAAGATCCCAACCTTGCTCTCAGGGAl 1 I IGGTGATGACi I CI ICTCACTCAATCATGCTCTTGATTAT 
GGGACCTGCGAATTrAGGCCTGCTTGGAACCAAGCAAATTCAGGATGTTTTGC Cl 1 11 GATTCGGATTTGAATCAA 

ctcttgacaggtctcatctttgtgagtattgttattgctctcatgct li n 1 ILl i ggacactaaactcggacaagc 
ctatattgctacaggggataatcctgatatgggtagaagtttcgggattcatactggacgcatggagctcatgggc 
30 ttggtcttatcaaatggtgtgattgcccttgcaggtgccctcattgctcagcaagaaggttatgccgatgtgtctc 
gagggatcggggttatcgttgtggggcttgcaagtttgattattggagaagttattttcaagagtttgag 
agagcgtttggttac tatc gttgtaggtrctatcgcttatcaa l - ri ri agtgtgggcagttatcgcacttgcc^ 
atacaagttaccttcg IT! atacagtgccttga rrii agcagtctgcctcatgattccaacatttaagcaaacaat 

CTTGAAAGGAGCCAAGTTAAGCAAATGA 

35 

4346.1 

ATGAAAAAAATGAAAGTTTGGTCTACTGTACTTGCAACGGGAGTTGCTCTTAC^ 

gttcaaattctacgactgl ri C 11 catctgaagaaaaagctgataaaagtcaagaattagttatctattcgaactc 
agtctcaaatggtcgtggtgattggttaactgctaaagcaaaagaagctggttttaatataaaaatggtt 
40 gctggcgctcaattagcagaccgtgttattgctgagaagaataatgcagttgcagatatggtatttggaattggtg 
ctcttgattcaaataaaattagagatcaaaaattactagtacagtacaagcctaaatggttagataaaattgatc 
aatctttatcagataaagataattattataatcctgtgattgttcaaccattagriu^aattggggcgcctgatgta 
aaagaaatgcctaaagattggactgaattaggtagtaagtataaaggtaaatattcaatttctggtcttcaatc 
ggtacaggacgggcaattctagcaagtatcttagttcgataccttgatgataaaggtgaattaggtgtttccgaaa 

45 AAGGTTGGGAAGTAGCAAAAGAATATTTGAAj\AATGCATACaCTCTTCAAAAGGGAGAAj\GT^ 

TGTTAG ACAA AGAAGATCCAATACAATATGGAATGATGTGGGGTTCTGGTGCATTAGTTGGACAAAAAGAACAAA 
ATGTTGTTTTCAAAGTTATGACTCCTGAGATTGGTGTACCATTTGTAACTGAj\CAAACTATG 
TAAAAAACAj^GCGTTAGCTAAAGAATTTATTGATTGGTTrGGTCAATCAGAAATTCAAGTAG 
CTTTGGATCTATTCCTGCAAATAAAGATGCCCTCAAAGATCTACCTGAAGATACGAAGAAATTTGT^^ 

50 AAACCACAAAATATTGACTGGGAAGCTGTTGGAAAGCATTTGGATGAATGGGTAGAAAAAGCTGAATTAGAATA^ 
GTACAATAA 

4346.2 

ATGATTAAATTTGATAATATTCAAATTAAATATGGTGAl 1 l lOl IGCAATTGATAATCTGAATTTAGATATACATG 
55 AAGGGGAATTTTTTACA 1 U Ll I GGGCCTTCAGGATGTGGTAAATCAACTACTTTGAGAGCATTGGTAG Gi 1 11 CT 

agatccatcatcaggaagtattgaagttaatggaacagatgtcactcatttggaacctgaaaagcgtcgaattgg 

T ATTGTATTTC AATCTT ATGCGCTATTTCC AACTATG ACTO riT i " 1'GATAATATTGCATTTGGTTTAAAAGTTAAG A 
AGCTAGCTCCAGATGTTATTAAAGCTAAAGTATCAGCAGTGGCAGCAAAAATTAAGATCTCTGATCAACAGTTAC 
AGCG TAAT GTATCAGAATTATCTGGGGGTCAACAACAAAGGGTAGCATTGGCTCGTGCTCTGGTTCTTGAACCTAA 
60 " AATTCTTTGTCTAGATGAACCATTGTCAAACCTTGACGCAAAATTACGTGTAGATTTGaGAAAAGAGT^ 

ctrcaaa aaga gttaggtattactacrrtatatgttactcatgatcaagaggaagccttgacttt 
ttgcagtct ttaa caatggatacatcgaacaggtcggtacaccagtagagatttatcataattctcaaactgaatt 
tgtatgtgattttattggagatattaatgtttrgaccgatgaaacagtccacgaagtattattgaaaaatac^ 
Gl n ILl iagaggataaaaaaggataca ttcga ttagagaaagttcgattcaatcgtgaaactgaacaagatttta 
65 ttctaaaagggacaattattgatgttgagi 1 1 ictggagttacaattcactatacaataaaagtttctgaaagtca 
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GAl ILl iAATCTAACAAGTATTGATAGTCAGGCTGCTATTAGATCTCTCCGAGA.\ACTGTCGAATTATTTATCACA 
CCATCAGACCTTCTGCAATTTTAA 

4346.3 

5 ATGCGTCATAAATTAAATTTAAAAG ATTGGCTTATTCGTTTAGGGTTAATCTG GT I C T! AGTAACATTTATTATTTA 

tccaaactttgatctagtagtgaatgtatttgtaaaaggagcagaattttcccttgatg 

aaatctcagatcgcacttcagagtattatgaacagttttaagttagcattttcactcatt 

ta^gtattctttgtgttctatttacagactactttgat^ 

TCT TTAATTTATGGAGGAG TGG 1 1 1 I AGCGACTGGATATAAArrtU'lCTATGGTCCTTATGGATTGATTACAAAATT 

10 TTT ACAAAATGTT ATCCL U C l" 1" 1 AG ACCCTAACTGG IT l ATTGGGTATGGTGCAGTCTTATTCATTATGACATTTT 

CAGGAACTGCTAATCATACATTGTTTTTAACAAATACAATTCGAAGCCrrGACTATCACACTATT^ 
AAATATGGG AGCAAA ACCATT TACT U i I Tl CCGAAAAGTACTGrrACCAACCTTAATTCCAACTCTATTTGCACTT 
ACTATT ATGG 11 ' 1 T 1 CT 1' A GTGG 11 I ATCTGC ACTAGCAGCACCC ATG ATTGTTGGTGGTAAAG AATTTCAAACT AT 
AAATCCAATGATT ATTA CATTTGCAGGGATGGGGAATTCTCGTGATTTAGCTGCCCTACTTGCAATTATTTTAG 

15 ATTGCAACTACAATTTTGCTTACTATCATGAATAAGATAGAAAAAGGTGGAAATTATATTTCTATCT 

AAGCGCCTCTTAAj\AAACAAAAAATTGCGTCTAAGCCTTGGAATATCATTGCTCACATTGTAGCATATGGAT^ 
CACAGTTTTCATG CTTC C ACTA ATTTTTATAGTATTATACTCATTTACAGATCCAGTTGCAA 
TA ACAT TATCAAACTTTACTTTAGAAAATTATCGCITATTCTTTAGTA^ 
AGCTTrATTT^^TTCTATTATTGCT 

20 TAAATCTCGTTTTGiVrTTCTTATTTGAA 

GTTTATTATTTACTTTTAATCAGCCACAA l I ' lCl I GTCTTGAATCAGA I rn 'GGTAGGTAGTTTGGTAATTCTACTT 
ATTGCATATATAGTTGTAAAAATCCCAriMU'CrrATAGAATGGTACGTGCTATTTTATTTACTGTTGATGATGAGAT 
GGAAGATGCAGCAAGAA GTATG GG TGCTT CAC Li i 1 I 1 ATACTATGATGAAGGTTATCATTCCATTTATTTTACCG 
GTTGTTCTCTCTGTTATTGCTTTAAACTTTAACTCITTATTA^ 

25 CTAGCrpAACCATTAGGTATTACGATTCGATCTGCAGGTGATGAAACAGCAACATCTAATGCACAAGCTCTGGTAT 
TTGTTTATACAATTGTrCTGATGATTATTTCTGCAACGGTATTATACTTCACACAAAGACCGGGGCGTAAAGTA^ 
GAAATAA 
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Table 2 

meelvtldclfidrtkieanankysfvwkkttekfsakloeqiqvyfqe£^^plukyamfdkkqkrgykesaknla^w 
hyndkedsythpdgwyyrfhhtkyqktqtdfqqeikvyyadepesapqkglymneryqnlkakecqallspqgrqif 
aqrkidvepvfgqikaslgykrcnlrgkrqvr1dmglvlmannllkyskmk2 

mgkghwnrkrvysirkfavgacsvmigtcavlxggniagesvvyadetlrn-rtaekpkeekmlveekadkaletkniv 
erteqsepssteaiasekkedeano-pkeekvsakpeekapriesqasnqekpucedakavtneevnqmiedrkvdfnqn 
wyfklnansicealkpdadvstwkkldlpydwsifndfdhespaqneggqlnggeawyrktfkldekdlkkkvrltf 
dgvymdsqvyvngqlvghypngynqfsydmcylqkdgrenviavhavnkqpssrwysgsgiyrdvtlqvtdkvhv 
ekngttiltpkl££qqhgkvethvtskrvrnt>dkdhelvaeyqrvergghavtglvrtasrtucahestsldallever 
/ pklwtvlndkpalyelitrvyrdgqlvdakkdlfgyryyhwtpnegfslngerikfhgvslhhdhgalgaeenykae 
yrrlkqmkemgvnsirtthnpaseqtlqiaaelgllvqeeafdtwyggkkpydygrffekdathpearkgekwsdfd 
lrtmvergknnpaifmwsigneigeangdahslatvkrlvkvikdvdktryvtmgadkfrfgngsgghekiadelda 
15 vgfnysednykalrakhpkwuygsetssatrtrgsyyrperelkhsngpernyeqsdygndrvgwgktataswtfd 
rdnagyagq^wtgtdyigeptpwhnqnqtpvkssyfg^vdtagipkhdfy^yqsqwvsv^ckkpmvhllpkw^wenk 
elaskvadsegkipvraysnassvelflngkslglctfnkkqtsdgrtyqegananelylewkvayqpgtleaiardes 
gkeiardkittagkpaavrlikedhaiaadgkdltyiyyeivdsqgnvvptannlvrfqlhgqgqlvgvdngeqasrer 
ykaqadgswirkafngkgvaivksteqagkftltahsdllksnqvtVftgkkegqektvlgtevpkvqtiigeapempt 
20 tvpfvysdgsraerpvtwssvdvskpgivtvkgmadgrevearveviaucselpvvkriapntdlnsvdksvsyvlidgs 
veeyevdkweiaeedkaklaipgsriqatgylegqpihatlvveegnpaapavptvtvggeavtgltsqkpmoyrtla 
ygaklpevtasaknaanttvloasaangmrasirqpkdggplqtyaiqfleeapiciahlslqvekadslkedqtvklsv 
rahyqdgtqavlpadkvtfstsgegevairkgmlelhkpgavtlnaeyegakdqveltiqantekxiaqsirpvnvvt 
dlhqepslpatvtveydkgfpkthkvtwqaipkekldsyqtfevlgkvegidlearakvsvegivsveevsvttpiaeap 
25 qlpesvrtydsnghvssakvawdairpeqyakegvftvngrlegtqlttklhvrvsaqteqganisdqwtgselplaf 
asdsnpsdpvsnvndklisynnqpanrwtnwnrtnpeasvgvlfgdsgilskrsvdnlsvgfhedhgvgvpksyvtey 
yvgktvptapknpsfvgnedhvfndsanwkpvtnlicapaqlkagemnhfsfdkvetyavrirmvkadnkrgtsitev 
qifakqvaaakqgqtriqvdgkdlanfnpdltdyylesvdgkvpavtasvsnnglatvvpsvregepvrviakaengd 
ilgeyrlhftkdicsllshkpvaavkqarllqvgqalelptkvpvyftgkdgyetkdltveweevpaenltkagqftvr 
30 grvlgsnlvaem/rvtdklgetlsdnpnydensnqafasatndidknshdrvdylndgdhsenrrwtnwsptpssnp 
evsagvifrengkivertvtqgkvqffadsgtdapsklvleryvgpefevptyysnyqaydadhpfnnpenweavpyr 

ADKDIAAGDEINVTFKAIKAKAMRWRMERKADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQIT 

ykgqrpkvsveennqvastvvdsgedsfpvlvrlvsesgkqvkeyrihltkekpvsefctvaavqedlpkiefvekdlay 
ktvekkdstlylgetrveqegkvgkeriftalnpdgskeeklrevvevptdrivlvgtkpvaqeakkpqvsekadtkpro 
35 sseasqtnkaqlpstgsaasqaavaagltllglsaglvvtkgkked2 

mkimkkkywtlailffclfnnsvtaqeipknldgnrrhtqtsesfsesdekqvdysnknqeevdqnkfriqidktelfvt 
tdkhleknccklelepqinndtvnsesnnllgednldnkikenvshldnrcgniehdkdnlessivrkyewdidkvtgg 
gesyklysksnskvsiaildsgvdlqntgllknlsnhsknyvpnkoylgkeegeegiisdiqdrlghgtavvaqivcddn 

40 INGVNPHVNINVYRIFGKSSASPDWIVKAIFDAVDDGNDIINLSTGQYLMIDGEYEDGTNDFETFLKYFCKAIDYANQKGV 
irVAALGNDSLNVSNQSDLLKLISSRKKVRKPGLVVDVPSYFSSTlSVGGIDRLGNLSDFSNKGDSDAlYAPAGSTLSLSEL 
GLNNFINAEKYKEDWrFSATLGGYTYLYGNSFAAPKVSGAlAMIIDKYKLKDQPYNYMFVKKFWKKHYQZ 

MICKTWKVFLTLVTALVAVVLVACGQGTASKDNKEAELKKVDFILDWTPNTNHTGLYVAKEKGYFKEAGVDVDLKLP 
45 PEESSSDLVINGKAPFAVYFQDYMAiCKLEKGAGrrAVAAfVEHNTSGIlSRKSDNVSSPKDLVGKKYCTWNDPTELAML 
KTLVESOGGDFEKVEKVPNNDSNSrrPlANGVFDTAWTYYGWDGILAKSQGVDANFMYLICDYVKEFDYYSPVIIANND 
YLKDNKEEARXVIQAfKKGYQYAMEHPEEAADILIKNAPELKEKRDFVIESQKYLSKEYASDKEKWGQFDAARWNAFY 
KWDICENGILKEDLTDKGFTNEFVKZ 

50 MKRTWRNSFVTNL^^*PFMIGNIEIPNRTVl^PMACVTNSAFRTIAKELGAGLVVMEMVSDKGIQYNNEKTLHMLHlDE 
GENPVSIQLFGSDEDSLARAAEFIQErrrKTDIVDINMGCPVNKIVKNEAGAiMWLKDPDKIYSUNKVQSVLDlPLTVSCMR 
TGWADPSLAVENALAAEAAGVSALAMHGRTREQMYTGHADLETLYKVAQALTKIPFIANGDIRTVQEAKQRJEEVGA 
DAVMIGRAAMCNPYLFNQINHYFETGEILPDLTFEDKMKIAYEHLKRLINLICGENVAVREFRGLAPHYLRGTSGAAICL 
RGAISQASTLAEIETLLQLEKA2 

MtKNPKLLTKSFLRSFAILGGVGLVlHIAIYLTFPFYYIQLEGEKFNESARVFTEYLfCTKTSDEIPSLLQSYSKSLTISAHLK 
RDIVDKRLPLVHDLDIKDGKL^NYtVMLDMSVSTADGKQVTVQFVHGVDVYKEAtCNILLLYLPYTFLVTlAFSFVFSYF 
YTKRLLNPLFYISEVTSKMQDLDDNIRFDESRKDEVGEVGKQINGMYEHLLKVIYELESRNEQIVKLQNOKVSFVRGAS 
HELKTPLASLRJILENMQHNIGDYKDHPKYIAKSINKIDQMSHLLEEVLESSKFQEWTECRETTVKPVLVDILSRYQELAH 
OO SIGVTIENQLTDATRVVMSLRALDKVLTNLISNAIKYSD(CNGRVIISEQDGYLSIKNTCAPLSDQEL.EHLFDIFYHSQIVTD 
KDESSGLGLYIVNNILESYQMDYSFLPYEHGMEFKISL2 

MYLGDLMEKAECGQFSILSFLLQESQTTVKj^VMEETGFSiCATLTKYVTLLNDICALDSGLELAlHSEDENLRLSIGAATK 
GRDIRSLFLESAVKYQILVYLLYHQQFLAHQLAQELVISEATLGRHLAGLNQILSEFDLSIQNGRWRGPEHQIHYFYFCL 
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FRKVWSSQEWEGHMQKPERKQEIANLEEICGASLSAGQKLDLVLWAHISQQRLRVNACQFQVIEEKMRGYFDNIFVI-R 
LLRKVPSFFAGQHIPLGVEDGEMMIFFSFLLSHRILPLHTMEYILGFGGQLADLLTQLIQEMKXEELLGDYTEDHVTYEL 
SQLCAQVYLYKGYILQDRYKYQLENRHPVLLMEHDFKETAEEIFHALPAFQQCTDLDFCKILWEWLQLrEYMAENCGQ 
HMRIGLDLTSGFLVFSRMAAIUCRYl^YNRFmEAYDPSRHYDLLVTNNPIHKKEQTPVYYLKNDLDMEDLVAlRQLLF 
5 T2 

MEFSKKTRELSIKKMQERTLDLLIIGGGrrGAGVALQAAASGLETGLIEMQDFAEGTSSRSTKLVHGGLRYLKOFDVEV 
VSDTVSERAVVQQL\PHIPKSDPMLLPVYDEDGATFSLFRlJCVAMDLYDLLAGVS^^PAA^^CVLSKDQVLERQP^a^ 
EGLVGGGVYLDFRNNDARLVIENIKRANQDGALlANHVKAEGFLFDESGKrrGWARDLLTDQVFEIKARLVIhrrTGPW 
10 SDKVR^^^^^CGTQFSQMRPTKGVHLVVDSSKIKVSQPVYFDTGLGDGRMVFVLPRENKTYFGTTDTDYTGDLEH^ 

QEDVDYLLGrVNNRFPESNrriDDfESSWAGLRPUAGNSASDYNGGNNGTISDESFDNLlATVESYLSKEKTREDVESAV 
SKLESSTSEKHLDPSAVSRGSSLDRDDNGLLTLAGGKrrDYRKMAEGAMERVVDUJCAEFDRSFKLINSKTYPVSGGELN 
PAKVDSEIEAFAQLGVSRGLDSKEAHYLANLYGSNAPKVFALAHSLEQAPGLSLADTLSLHYAMRNELTLSPVDFLLRR 
TNHMLFMRDSLDSIVEPILDEMGRFYDWTEEEKATYRADVEAALANNDLAEUCN2 

15 

MMNELFGEFLGTLIULLGNGVVAGVVLPKTKSNSSGWIVITMGWGIAVAVAVFVSGKLSPAYLNPAVTIGVALKGGLP 

WASVLPYILAQFAGAMLGQILVWLQFKPHYEAEENAGNILATFSTGPAIKDTVSNLISEILGTFVLVLTIFALGLYDFQA 

GIGTFAVGTLIVGIGLSLGGTTGYALNPARDLGPRIMHSILPIPNKGDGDWSYAWIPVVGPVIGAALAVLVFSLFZ 

20 NtTKKKTERISVTHREKn-WLKWYFMRDKEQPKYSVLERKMFDAAKNQDMLAYQKYATrKQrro 
VYVYNHMNVIGACQRILFISQSPAYDKLNKWFNIYSDLYFSVVPLPKMGVYHEMVGIZ 

MKNSNEAEMKLLYTDIRTSLTEILTREAEELVAAGKRVFYlAPNSLSFEKERAVLEYLSQQASFSrrVTRFAQMARYLVL 
NDLPAKTTUDDIGLGLAFYKCLAELDPKDLRVYGAIKQDPQLIQQLlELYHENrrKSQMSFLDLENLTDEDKRADLLLIF 

25 EKVTAYLNQGQLAQESQLSHLIEAlENDKVSSDFNQIALVlDGFTRFSAEEERVVDLLHGKGVErviGAYASKKAYTSPFS 
EGNLYQASVKFLHHLASKYQTPAQDCSQTHEKMDSFDKASRLLESSYDFSELALDVDEKDRENLQIWSCLTQKEELEL 
VARSIRQKLHENSDLSYKHFRILLGDVASYQLSLKTIFDQYQIPFYLGRSEAMAHHPLTQFVESILAUCRYRFRQEDUNL 
LRTDLYTDLSQSDIDAFEQYIRYLGINGLPAFQQTFTKSHHGKFNLERLNVLRLRILAPLETLFASRKQfCAEKLLQKWSV 
FLKEGAVTKQLQDLTTTLEAVEQERQAEVWKAFCHVLEQFATVFAGSQVSLEDFIJVLLHSGMSLSQYRTIPATVDTVL 

30 VQSYDLIAPLTADFVYAIGLTQDNLPKISQ^^TSLLTDEERQNLNQATEEGVQLLIASSENLKKNRYTMLSLVNSARKQLF 
LSAPSLFNESESKESAYLQELIHFGFRRREKRMNHKGLSKEDMGSYHSLLSSLVAYHQQGEMSDTEQDLTFVKVLSRVI 
GIOCLDQQGLENPAIPTSPSSKTLAKDTLQALYPAKQEFYI^TSGLTEFYRNEYSYFLRYVLGLQEEUU-HPDARSHGKFL 
HRIFERALQLPNEDSFDQRLEQAIQETSQEREFEAIYQESLEAQFTKEVLLDVARTTGHILRHNPAIETIKEEANFGGKDQ 
AFIQLDNGRSVFVRGKVDRIDRLKANGAIGVVDYFCSSLTQFQFPHFFNGLNSQLPTYLAALKREGEQNFFGAMYLEMA 

35 EPVQSLMAVKSLAGAVVEASKSMKYQGLFLEKESSYLGEFYNKNKANQLTDEEFQLLLDYNAYLYKKAAEKILAGRF 
AiNPYTENGRSlAPYVQQHQArrGFEANYHLGQARFLEKLDLADGKRLVGEKLKQAWLEKIREELNRZ 



MKLIPFLSEEEIQKLQEAEANSSKEQtCICTAEQIEAlYTSAQNILVSASAGSGKTFVMAERILDQLARGVEISQLFISTFTVK 
40 AATELKERLEKKISKKIQETDDVDLXQHLGRQLADLPNAAIGTMDSFTQKFLGKHGYLLDIAPNFRILQNQSEQLILENE 
VFHEVFEAHYQGKQKETFSHLLKNFAGRGKDERGLRQQVYKIYDFLQSTSNPQKWLSESFLKGFEKADFTSEKEICLTE 
QIKQALWDLESFFRYHLDNDAKEFAKAAYLENVQLILDEIGSLNQESDSQAYQAVLARVVAISKEKNGRALTNASRKA 
DLKPLADAYNEERKTQFAKLGQLSDQlAlLDYQERYHCDTWKLAiCTFQSFMSDFVEAYRQRKRQENAFEFADlSHYTrE 
ILENFPQVRESYQERFHEVMVDEYQDTNHIQERMLELLSNGHNRFMVGDIKQSIYRFRQADPQIFNEKFQRYAQNPQEG 
45 RJLULKENFRSSSEVLSATNDVFERLMDQEVGEINYDNKHQLVFANTKLTPNPDNKAAFLLYDKDDTGEEEESQTETKL 
TGEMRLVIKEILKLHQEKGVAFKEIALLTSSRSRNDQILLALSEYGIPVKTDGEQNNYLQSLEVQVMLDTLRVIHNPLQD 
YALVALMKSPMFGFDEDELARI^LQKAEDKVHENLYEKLVNAQKMASSQKGLIHTALAEKUCQFMDILASWRLYAICT 
HSLYDLIWKIYNDRFYYDYVGALPNGPARQANLYALALRADQFEKSNFKGLSRFIRMIDQVLEAQHDLASVAVAPPKD 
AVELMTIHKSKGLEFPYVFILNMDQDFNKQDSMSEVILSRQNGLGVKYIAKMETGAVEDHYPKTIKLSIPSLTYRQNEEE 
50 LOLASYSEQMRLLYVAMTRAEKKLYLVGKGSREKLESKEYPAAKNGKLNSNTRLQARNFQDWLWAiSKVFTKDKLNF 
SYRFIGEDQLTREAIGELETKSPLQDSSQADNRQSDTIKEALEMLKEVEVYNTLHRAAIELPSVQTPSQDCKFYEPVMDM 
EGVEIAGQGQSVGFCKISFDLPDFSTKEKVTGAEIGSATHELMQRIDI^QQLTLASLTETUCQVQTSQAVRDKINLDKILAF 
FDTVLGQEILANTDHLYREQPFSMLKRDQKSQEDFVVRGILDGYLLYENKIVLFDYKTDRYDEPSQLVDRYRGQLALY 
EEALSRAYSIENIEKYLILLGKDEVQVVKV2 

MELARHAESLGVDAIATIPPIYFRLPEYSVAKYWNDISSAAPNTDYVIYNIPQLAGVALTPSLYTEMLKNPRVIGVKNSS 
MPVQDIQTFVSLGGEDHIVFNGPDEQFLGGRLMGARAGIGGTYGAMPELFLKLNQLIADKDLETARELQYAINAIICKL 
TSAHGNMYGV1KEVLKINEGLNIGSVRSPLTPVTEEDRPVVEAAAALIR£TKERFL2 

60 

MYKTKCLREKLVLFLKIFFPILIYQFANYSASFVDTAMTGQYNTMDLAGVSMATSIWNPFFTFLTGIVSALVPIIGHHLG 
RGKKEEVASDFYQFIYLALGLSVVLLGMVLFLAPHLNHIGLEAAVAAVAVRYLWFLSIGIIPLLLFSVIRSLLDSLCLTKL 
SMYLMLLLLPLNSGFNYLLIYGAFGVPELGGAGAGLGTSLAYWVLLGISVLVLFKQEKLKALHLEKRIPLNMDKrKEGV 
RLGLPIGGTVFAEVAIFSVVCLIMAKFSPUIASHQSAMNFSSUMYAFPMSISSAMAIVVSYEVGAKRFDDAKTYIGLCRW 
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TALIFAAFTLTFLYIFRGNVASLYGNDPKFU>LTVRFLTYSLFFQLADTFAAPLQGILRGYKDTVIPFYLGLLGYWGVAJP 
VYAI2 

MSTlj\iaEAlXFVAGEDGIRVRQLAEIXSLPin-GIQQSLGICLAQKYEKDPDSSL^^ 
5 PINQSL5RAALETLSIIAYKQPrTRIEmAmGVNSSGAUSXLQAFDLnCEDGFaCEVLGRPNLY\nTDYFLDYMGINHl^ 
PVIDELEIQAQESQLFGERIEEDENQ2 

MDTMISRiTRHLFEAUCSUCRNGWMTVAAVSSVMrrLTLVAIFASVlFOTAKLATDIENNVRWV^ 
KEGQTVTNNDYHKVYDSLKNMSTVKSVTFSSKEEQYEKLTElMGDNWKIFEGDANPLYDAYIVEAhrrPNDVKTU^ 
10 KKIEGVSEVQDGGANTTERLFKLASnRVWGLGlAALUFlAVFLISmriRmiSRSREIQIMRLVGAKNSYIRGPFLLEGAFI^ 
LLGAlAPSVLVFIVYQrVYQSVhnCSLVGQNLSMlSPDLFSPLMlALLFVIGVFIGSLGSCISMRRFUCIZ 

MKKVRFIFLALLFFLASPEGAMASDGTWQGKQYLJCETCSOAANEWVFDTHYQSWFYIKADANYAENEW 
YUCSGGYMAKSEWVEDKGAFYYLDQDGKMKRNAWVGTSYVGATGAXVIEDWVYDSQYDAWFYIKATC 

15 LQIKGKDYYFKSCGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYTKENGNYADKEWIFENGHYYYLKSGGY 
MAANEW^WDKESWFY1JCF^X;KMAEKEWVYDSHSQAWYYFKSGGY^T^AJ^^EWIWDKESWFYLKSTC 
SHSQAWYYFKSGGYNH'ANEWIWDKESWFYUCSDGKIAEKEWVYDSHSQAWYYFKSGGYMAKNETVDGYQLGSDGK 
WLGGKTTNENAAYYQVVPVTANVYDSDGEKLSYISQGSVVWLDKDRKSDDICRLAmSGLSGYMKTEDLQALDASKD 
nPYYESDGHRFYHYVAQNASIPVASHLSDMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAEELDKVFSLLNI 

20 ^WSLLE^nCGATFKEAEEHYH^NALYLLAHSALES^WGRSKlAKDKNNVFG^^AyDTTPYLSAIa•FDDVDKGILGATKW 
KENY1DRGRTFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKD2 

MKKVLQKYWAWAFWIPLLLQAIFFYVPMFQGAFYSFrrrWTGLTYhnTCFVGLNNFICLLFM 

VVGEIALGIFIARVLNSKIKGQTFFRAWFFFPAVLSGLTVAUFKQVFNYGLPAICNALHIEFFQTSLLGTKWGAIFAAVF 
25 VLLWQGVAMPIIIFLAGLQSIPTErrEAARIDGATSKQVFWNIELPYLLPSVSMVFfLALKGGLTAFDQVFAMTGGGPNN 
ATTSLGLfLVYNYAFKNNQFGYANAUVILFFLIVVISlIQLRVSICKFEIZ 

MMKQDERJCAUGKYlLLILGSVULVPLLATU^SSFKPTKDIVDNFFGFPTNFTWDNFSRLLADGIGGYYWNSVVrrVl^^ 
LAVMinPMAAYSlARNMSKRKAFriMYTLLILCIFVPFQVIMIPrrVMMSKLGLAhrrFGLILLYLTYAIPQTLFL^ 
30 SIPESLDEAAEIDGANQFTTYFRIIFPMMKPMHATTMIINALWFWNDFMLPLLVLNRDSKMWTLPLFQYNYAGQYFND 
YGPSFASYVVGUSrrrVYLFFQRHIISGMSNGAVICZ 

MKSlLQKMGEHPMLLLFLSYSTVISrUVQhWMGLVASVGMFLFTIFFLHYQSILSHKFFRLILQFVLFGSVLS 
FQIVKKFNYAFLSPNMQVWHQNRAEVTFFNPNYYGIICCFCIMIAFYLFTTTKLNWLKVFCVIAGFVNLFGLNFTQNRT 
35 AFPAIIAGAIIYLFTTIKNWKAFWLSIGVFAIGLSFLFSSDLGVRMGTLDSSMEERISIWDAGMALFKQNPFWGEGPLTYM 
NSYPRIHAPYHEHAHSLYIDTlLSYGIVGTILLVI^SVAPVRLMMDMSQESGKRPnCLYLSFLTVVAVHGlFDLALFWIQS 
GFIFLLVMCSIPLEHRMLVSDMTD2 

MSKMDVQKIIAPMMKFVNMRGIIAUCTCMLAILPLTVVGSLFLIMGQLPFEGLNKSIASVFGANWTEPFMQVYSGTFAI 
40 MGLISCFSIAYSYAKNSGVEALPAGVLSVSAFFILLRSSYIPKQGEAIGDAISKVWFGGQGIIGAIIIGLVVGSIYTFFIKRKIV 
rKMPEQVPQAL\KQFEAMIPAFVlFL5SMrVYILAKSLTNGGTFlEMIYSAlQVPLQGLTGSLYGAlGlAFFISFLWWFGVH 
GQSVVNGVVTALLLShfLDANKAMLASANLSLENGAHIVTQQFLDSFLILSGSGrTFGLVVAMLFAAKSKQYQALGKVA 
AFPAlFNVNEPVVFGFPrVMNPVMFVPFILVPVLAAVIVYGAlATGFMQPFSGVTLPWSTPAlLSGFLVGGWQGVtTQLVI 
LAMSTLVYFPFFKVQDRLAYQNEIKQS2 

MKKKDLVDQLVSErETGKVRTLGIYGHGASGKSTFAQELYQALDSTTVNLLETDPYITSGRHLVVPKDAPNQKVTASLP 
VAHELESLQRDILACRRVWMSZ 

MKXRYLVLTALLALSLAACSQEKTK^mDGETKTEQTAKADGTVGSKSQGAAQKKAEVV^OCCDYYSIQGKYDEI^ 
50 KHYPLSKDYNPGENPTAKAEL.VKLIKAMQEAGFPISDHYSGFRSYETQTFCLYQDYVNQDGKAAADRYSARPGYSEHQT 
GLAFDVIGTDGDLVTEEKAAQWLLDHAADYGFVVRYLKGKEKETGYMAEEWHLRYVGKEAKEIAASGLSLEEYYGF 
EGGDYVD2 

MREPDFLNHFLKKGYFKKHAKAVLALSGGLDSMFLFKVLSTYQfCELEIELILAHVNHKQRIESDWEEKELRKLAAEAE 
55 LPIYISNFSGEFSEARARNFRYDFFQEVMKKTGATALVTAHHADDQVETIFMRLIRGTRLRYL5GIKEKQVVGEIEIIRPFL 
HFQKKDFPSIFHFEDTSNQENHYFRNRIRNSYLPELEKENPRFRDAILGIGNEILDYDLAIAELSNNINVEDLQQLFSYSES 
TQRVU.QTYLNRFPDLNLTKAQFAEVQQILKSKSQYRHPIKNGYELIKEYQQFQICKISPQADEKEDELVLHYQNQVAY 
QGYLFSFGLPLEGELIQQIPVSRETSIHIRHRKTGDVUKNGHRKKLRRLFIDLKIPMEKRNSALIIEQFGEIVSILGIATNNL 
SKKTKNDIMNTVLYIEKIDR2 

60 

mrkflhlllpsfltlsicvvstekevvytskeiyyl5qsdfglyfr£klsspmvygevpvyanedlvvesgkltplctsfqrr 
ewrlnkqgipvfklsnhqflaadkrflydqsevtptnckvwlesdfklynspydlkevksslsaysqvsidjctmfvegre 
flhidqagwvakestseednrmskvqemlsekyqkdsfsiyvkqlttgkeaginqdekmyaasvlklsylyytqekin 
eglyqldttvkyvsavndfpgsykpecsgslpkkednkeyslkdlitkvskesdnvahnllgyyisnqsdatfkskmsa 
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rmgddwdpkeklisskmagkfmeaiynqngfvlesltktdfdsqrlakcvsvkvahkigdadefkhdtgvvyadspfil 
siftknsdydtisk1akdvyevuc2 

mkkqnngliknpflwllfrfflvtcfqyfysgnnsggsqqinyteuvqerrixinvkeltyqpngsvievsgvyknpktsk 
5 eetgiqfftpsvtkvekftstilpadttvselqki^tdhkaevtvkhesssgiwinllvsrvpfgilffflfsmmgn^ 
ngrjs^pmsfgi^icakaa^ncedikvrfsdvagaeeekqelvevveflkdpkrftklgaripagvllegppgtgktl 
vageagvpffsiscsdfvemfvgvgasrvrslfedakkaapahfideidavgrqrgvglgggndereqtlnqluemdg 
fegnegiiviaatnrsdvldpallrpgrfdrkvlvgrpdvkgreailkvhaknkplaedvdlklvaqqtpgfvgadlen 
vlneaalvaarrnksiidasdideaedrviagpskkdktvsqkerelvayheaghtivglvlsnarvvhkvtivprgra 
10 ggymialpkedomllskedmkeqlaglmggrvaeeiifnvqttgasndfeqatqmaramvteygmseklgpvqyeg 
nhamlgaqspqksiseqtayeideevrsllnearnkaaeiiqsnrethkliaeallkyetldstqikalyetgkmpeave 
eeshalsydevkskmndek2 

mkrssllvrmvisiflvflillalvgtfyyqssssaieatiegnsqttisqtshfiqsyikklettstgltqqtdvlayaen^ 
15 sqdkvegirdlflmksdkductvvlvtksgqvistddsvqmktssdmmaedwyqkaihqgampvltparksdsqw 
visvtqelvdakganlcvlrldisyetleaylnqlqlgqqgfafiinenhefvyhpqhtvysssskmeamkpyidtgqg 
ytpghksyvsqekugtdwtvlgvsslekldqvr5qllwtllgasvtsllvclclvwfslkrwiaplkdlretmleias 
gaqnlrakevgayelrevtrqfnamldqidqlmvairsqeettrqyqlqalssqinphflyntldtiiwmaefhdsqr 
vvqvtkslatyfrlaln0gkdlicl5deinhvrqylfiqkqrygdkleyeinenvafdnlvlpklvlqplvenalyhgi 
20 xekegqghiklsvqkqdsglvirjeddgvgfqoagossqsqlkrggvglqnvdqrljclhfganyhmkrdsrpqkgtkv 
eiyinriets2 

mkrssllvrmvisiflvfullalvgtfyyqssssaieatiegnsqttisotshfiqsyikklettstgltqqtdvlayaenp 
s0dkvegirdlfltiu3dkdlktvvlvtksgqvistddsvqmktssdmmaedwyqkaihqgampvltparksdsqw 

25 visvtqelvdakganlgvlrldisyetleaylnqlqlgqqgfafiinenhefvyhpqhtvysssskmeamkpyidtgqg 
ytpghk5yvsqek1agtdwtvlgvsslekldqvrsqll\\tllgasvtsllvclclvwfslkrwuplkdlr£tml^ 
gaqnlrakevgayelrevtrqfnamldqidqlmvairsqeettrqyqlqalssqinphflyntldtiiwmaefhdsqr 
vvqvtkslatyfrlalnqgkdliclsdeinhvrqylfiqkqrygdkleyeinenvafdnlvlpklvlqplvenalyhgi 
kekegqghiklsvqkqdsglvirieddgvgfqdagdssqsqlkrggvglqnvdqrlklhfganyhmkidsrpqkgtkv 

30 eiyinrietsz 

mffkllrealkvkqvrskilftifrvlvfrlgtsrrvpgvnanslnal^glsflnmlslvsgnai-krn^sifalgvspyitasi 
vvqllqmdilpkfvewgkqgevgrrklnqatrylalvl^fvqsigitagfnrrlagaquktaltpqvfltignltagsmi 
vtwlgeqrrdkgygngvsmiifagivsslpemiqgryvdyfvnvpssritssiifviu,irtvluiyfttyvqqaeyklplqytk 
35 vaqgapsssylplxvnpagvipvifassrraapaallqflsatghdwawvrvaqemlattsptglamyalliilftffytf 
vqinpekaaetykrvvpismefvlvkvqkniclnffvvlqllvpsslv2 

mdirqvtetlamieeqnfdirtrrmgislldcidpdinraaeklyqkritk.\anlvavgdelaaelgipivnkrvsvtpisug 
aatdatdyvvlakaldkaakeigvdfiggfsalvqkgyqkgdeilinsipralaetdkvcssvnigstksginmtavad 
40 mgriiketanlsdmgvaklwfanavednpfmagafhgvgeadviinvgvsgpgvvkralekvrgqsfdvvaetvkk 
tafkitrjgqlvgqmaserlgvefgrvdlslaptpavgdsvarvleemgletvgthgttaalallndqvkkggvmac 
nqvggl^gafipvsedegmiaavqngslnleklean^aicsvgldmiaipedtpaetiaamiadeaaigvinmkttavrii 
pkgkegdmiefggllctapvmkvngassvdfisrggqipapihsfkn2 

45 mtqiidgkalaaklqgqlaektaklkeetglvpglvvilvgdnpasqvyvrnkersalaagfrsevvrvpetitqeell 
dliakynqdpawhgilvqlplpkhideeavllaidpekdvdgfhplnmgrlwsghpvmipstpagimemfheygidleg 
knavvlgrsnivgkpmaqlllaknatvtlthsrthnlskvaakadrlvvalgrakfvtadfvkpgavvidvgmnrden 
gklcgdvdyeavaplashitpvpggvgpmtitmlmeqtyqaalrtldrxz 

50 mskfnrihlvvldsvgigaapdannfvnagvpdgasdtlghisktvglnvpnmakiglgnipretplicwaaesnptgy 
atkleevslgkdtmtghweimglnrtepfdtfwngfpeeiltkieefsgrkvireankpysgtaviydfgprqmetgeui 
ytsadpvlqlaahediipldelyriceyarsrtlerpallgrllarpyvgepgnftrtanrrdlavspffptvldklneagl 
dtyavgkindifngaginhdmghnksnshgidtllktmglaefekgfsftnlvdfdalyghrrnahgyrdclhefde 
rlpeiiaamrendlllitadhgndptyagtdhtreyipllayspafkgngupvghfadisatvadnfgvetamigesfl 

55 DKLV2 

MFlSISAGIVTFLLTLVEIPAFIQFYRKAQrrGQQMHEDVKQHQAKAGTPTMGGLVFLrrSVLVAFFFALFSSQFSNNVGM 
ILFILVLYGLVGFLDDFLKVFRKINEGLNPKQKLALQLLGGVIFYLFYERGGDILSVFGYPVHLGFFYIFFALFWLVGFSN 
AVNLTEX3VDGLASISVVISL^AYGVlAYVQGQMDILLVILAMIGGLLGFFlFNHKPAKVFMODVGSLALGGMLAAlSMA 
60 LHQEWTLLIIGIVYVFETTSVMMQVSYFKLTGGKRIFRMTPVHHHFELGGLSGKGNPWSEWKVDFFFWGVGLLASLLT 
LAILYLM2 

LFKKNKDILNlALPAMGENFLQMLMGMVDSYLVAHLGLIAlSGVSVAGNimYQAIFIALGAAlSSVISKSIGQKDQSKLA 
YHVTEALKITLLLSFLLGFLSIFAGKEMIGLLGTERDVAESGGLYLSLVGGSIVLLGLMTSLGALIRATHNPRLPLYVSFL 
65 SNALNILFSSLAIFVLDMGIAGVAWGTIVSRJLVGLVILWSQLKLPYGKPTFGLDKELLTLALPAAGERLMMRAGDVVIIA 
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LVVSFOTEAVAGNAlGEVLTQFKYMPAFGVATATVMLLARAVGEDDWKRVASLSKQTFWI^LFLMLPL^FSnrVLGVP 
LTHLYTTDSLAVEASVLVTLFSLLGTPMTTGTVIYTAVWQGLGNARLPFYATSIGMWCIRICTGVLMGIVLGWGLPCIW 
AGSLLDNCFRWLFLRYRYQRYMSLKG2 

5 MCTTQEKHSQAAVLGLQHlXAMYSGSn-VPIMlATALGYSAEQLTYLlSTDIFMCGVATFLQLQLNKYFGIGLPVV^^ 

FQSVAPLIMIGQSHGSGAMFGAUASGIYVVLVSGIFSKVANLFPSIVTGSVITTIGLTLIPVAIGNMGNr^VPEPTGQSLLLA 
AnVUILLmiFTKGFIKSISlLIGLWGTAlAATMGLVDFSPVAVAPLVHVPTPLYFGMPTFEISSrVMMCIIATVSMVECT 
GVYLAL^DnXDPIDSTRLRNGYRAEGLAVLLGGlFNTFPYTGFSQKVGLVKLSGIKKRLPIYYAAGFLVLUSLLPKFGA 
LAQIIPSSVLGGAMLVMFGFVSIQGMQILARVDFANNEHNFLlAAVSlAAGVGLNNSNLFVSMPTAFQMFFSNCrVVASL 
10 LAIVLNAVLNHKKKZ 



MKDRlKEYLQDKGKVTVNDUiQALGKDSSKDFRELIKTLSLMERKHQIRFE£DGSLTLEDaCKHEITLKGIFR\HKNGFG 
FVSLEGEEDDLFVGKNDVNYAIDGDTVEWIKKVADRNKGTAAEAKIIDILEHSLTTVVGQIVLDQEKPKYAGYIRSKN 

15 QKISQPIYVKKPALiCLEGTEVLKVFIDKYPSKKHDFFVASVLDVVGHSTDVGIDVLEVLESMDIVSEFPEAVVKEAESVP 
DAPSQKDMEGRLDLRDErrFTIDGADAKDLDDAVHIKALKNGNLEFGVHlADVSYYVTEGSALDKEALNRATSVYVTD 
RVVPMLPERLSNGICSLNPQVDRLTQSAIMEIDKHGRVVNYTrrQTVUCTSFRMTYSDVNDILAGDEEKRKEYHKrVSSIE 
LMAKLHETLENMRVKRGALKTOTNEAKILVDICQGKPVDIVLRQRGlAERMIESFMLMANETVAEHFSKLDLPnYRIHE 
. £PiCA£KVQKHDYASSFGLRIYGTAS£ISQ£ALQDIMRAVEG£?YADVLSMMIXRSM(^ARY5EH^ 

20 HFTSPIRRYPDLLVHRMIRDYGRSKEIAEHFEQVIPEIATQSSNR£RRAIEAER£VEAMKKAEYMEEYVGE£YDAVVSSIV 
KFGLFVELPKTVEGUHrrNLPEFYHFNERDLTLRGEKSGrrFRVGQQ[RlRVERADKMTGEIDFSFVPSEFDV!EKGLKQS 
SRSGRGRDSNRRSDKKEDKRKSGRSNDKRKHSQKDICKKKGKKPFYKEVAKKGAKHGKGRGKGRRTKZ 

MGTTGFniDUILIVYLLAVLVAGIYFSKKEMKGKEFFKGDGSVPWYVTSVSCFATMLSPISFLGLAGSSYACSWILWFA 
25 QLGMVV/^IPLTn^LPIFARIDIDTAYDYLDKRFNSKALRJISALLFIIYQLGRMSIIMYLPSAGLSVLTGUDINIUIU^GVV 
AIVYSYTGGLKSVL>^DRQGVILISGVVLALFVLIANIKGGFGAVAETLANGKFLAANEKLFDPNLLSNSIFLIVMGSGF 
TII^SYASSQDLVQRJTTTQNIKKLNKMLFTNGVl^LATATVFYLIGTGLYVFYQVQNADSAASNIPQDQIFMYF 
PVGrrGUUUVrYAASQSTISTCLNSVATSWTLDIQDVISKNMSDNRRTKIAQFVSLAVGLFSIGVSrVMAHSDlKSAYEWF 
NSFMGLVLGLLGGVnLGFVSKKANKQGAYAALrVSTIVMVFIKYFLPPTAVSYWAYSUSISVSVVSGYIVSVLTGNKVS 
30 APKYTnHDrrEDCADSSWEVRHZ 

MKFSKKYlAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQrVKITDQCYV 
TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVIXJKYYVYLKDAAHADKVRTKDEINRQK 
QEHVKDNEKVNSNVAVARSQGRYTTNIXJYVFNPADirEDTGNAYIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNM 
35 QPSQLSYSSTASDNNTQSVAKGSTSJCPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHF 
IPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEVVSSLGSLSSNPSSLTTSKELSSASDGYIFNPKDP/EETATAYIVRHGD 
HFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIK 
VRKN12 

40 MKKRAIVAVrVLLLIGLDQLVKSYIVQQIPLGEVRSWIPNFVSLTYLQNRGAAFSILQDQQLLFAVlTLVVVIGArWYLHK 
HMEDSFWMVLGLTLIIAGGLGNFIDRVSQGFVVDMFHLDFINFAIFN^ADSYLTVGVIILLIAMLICEEINGNZ 

MNrrNl^FIVGLIIDENDRFYFVQKDGQTYALAKEEGQHTVGDTVKGFAYTDMKQKLRLTTLEVTATQDQFGWGR^ 
EVRKDLGVFVDTGLPDKEIVVSLDILPELICELWPICKGDQLYIRLEVDKKDRIWGLLAYQEDFQRL.ARPAYNNMQNQN 
45 WPArVYRLKLSGTFVYLPENNMLGFIHPSERYAEPRLGQVLDARVIGFREVDRTLNLSLKPRSFEMLENDAQMlLTYLE 
SNGGFMTLNDKSSPDDnCATFGISKGQFKKALGGLMlCAGKIKQDQFGTELrZ 

MKDVSLFLUCKVFKSRLNWrVl^LFVSVLGVTFYLNSQTANSHSLESRLESRlAANERAlNENEEKL^QMS^^^ 
AKNNLDVQKNLLTRJCTEILTLLKEGRWKEAYYLQWQDEEKNYEFVSNDPTASPGLKMGVDRERKIYQALYPLNIKAH 
50 TLEFPTHGIDQIVWrLEVIIPSLFVVAIIFMLTQLFAERYQNHUDTAHLYPVSKNT-FAlSSLGVGVGYVTVLFIGICGFSFLV 
GSUSGFGQLDYPYPIYSLVNQEVTIGKIQDVLFPGLIXAFLARVIVEVVYLIAYFFKQKMPVLFLSLIGIVGLLFGIQTIQP 
LQRIAHLIPFTYLRSVEILSGRLPKQIDr^DLNWSMGMVLLPCLIIFLLLGILFIERWCSSQKKEFFNRFZ 

MMmLDfVSTPAlLVALlAILGLVLOKKKLPDIIKGGIICTFVGFLVVSGGAGIVQNSLNPFGTMFEHAFHLSGVVPNNEAI 
55 VAVALTTYGSATAMIMFAGMVPNILlARFTRfKYIFLTGHHTLYMACMlAVILSVAGFTSLPULLGGLALGIIMSISPAF 
VQKYMVQLTGNDKVALGHFSSLGYWLSGFTGSUGDKSKSTEDIKFPKSLAFLRDSTVSrTLSMAVIYlIVAlFAGSEYIEK 
EISSGTSGLVYALQLAGQFAAGVFVILAGVRLILGEIVPAFKGISERLVPNSKPAHXrPIVYTYAPNAVLIGFlSSFVGGLVS 
MVIMlASGTVVILPGVVPHFFCGATAGVIGNASGGVRGATIGAFLQGlLISFLPVFLMPVLGGLGFQGSTFSDADFGLSGn 
LGMLNQFGSQAGIVIGLVULAVMFCVSFIKKPSATEEZ 

60 

MIKTFLSALSVILFSIPirrYSFFPSSNLNIWLSTQPlLAQIYAFPLATATMAAILSFLFFFLSFYKKNKQIRFV'SGILLl^ 
LLFGTDICTI^SASNKTKTIJCLVTWNVANQIEAQHIER1FSHFDADMAIFPELJ>lTNIRGEQENQR1KLLFHQVGLSM 
IFTSPPTNSGUPVTVrVKKSYGFYTEAKTFHTTRFGTIVLHSRKQNIPDUALHTAPPLPGLJvIEIWKQDLNIIHNQLASKY 
KAIIAGDFNATMRHGALAKISSHRDALNALPPFERGTWNSQSPKLFNATIDHILLPKNHYYVKDLDIVSFQNSDHRCIFT 

65 ErTFZ 
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MNPIQRSWAYVSRKRLRSFlLFLILLVLLAGISACLTLMKSNKTVESNLYKSL^^reFS^^^ 
E^^VSPELETVAiCUCDKEAVTGEQSVERDDLSAADNNLVSLTALEDSSKDVTFTSSAFNUCEGRHLQKCDSK^ 
AKKNGLSLHDKIGLDAGQSESGKGQTVEFEIIGIFSGKKQEKFTGLSSDFSENQVFTDYESSQTLLGNSEAQVSAARFYVE 
5 NPKEMDGLMKQVENLALENQGYQVEKE>fKAFEQ[KDSVATFQTFLTIFLYGMLlAGAGAULVLSLWLRERVYEVGa, 
LALGKGKSSIFLQFCLEVVLVSLGALLPAFVAGNArrTYLLQTLLASGDQASLQDTlJVKASSL^ 
CLSVALCFLFLFRKSPKEILSSISZ 

MLHNAFAYVTRKFFKSIVrFLIILLMASL^LVGL^IKGATAKASQETFKNITNSFSMQINRRVNQGTPRGAGNIKGEDnC^ 
10 TE^«CArESYVKRINAIGDLTGYDUETPETKKNLTADRAKRFGSSLMr^GVNDSSKEDKFVSGSYKLVEGEHLTNDDKDK 
ILLHKDLAAKHGWKVGDKVKLDSNIYDADNEKGAKETVEVTUCGLFDGHNKSAVTYSQELYEOTAn'DIHTAAKL 
TEDTAIYGDATFFSnrADKNLDDVMKELNGISCINWKSYTLVBCSSSNYPALEQSISGMYKMANtXFWGSLSFSVLLLALL 
LSLWINARRKEVGILL^IGLKQASILGQFrrESIUAlPALVSAYFLANYTARAlGNTVUVNVTSGVAKQASK^ 
GAEVDGFSKTLSSLDISIQTSDFIIIFVLALVLVVLVMALASSNLLRKQPKELLLDGEZ 

MSQDKQMKAVSPLLQRVINISSIVGGVGSLIFCIWAYQAGILQSKETLSAFIQQAGIWGPPLFIFLQILQTVVPIIPGALTSV 

AGVFIYGHIIGTIYNYIGrVIGCAIIFYLVRLYGAAFVQSVVSKRTYDKYIDWLDKGNRFDRFFIFMMIWPISPADFLCMLA 

ALTKMSFKRYMTIIILTKPFTLVVYTYGLTYIIDFFWQML2 

20 MRNMWW^KETYLRKVESWSFFFMV^SPFLFLGlSVGIGHL<y3SSMAK^WKVAVV^TVPSVAEGLKW 

ASAKEAIKEEKLKGYLTIDQEDSVLKAVYHCETSLENGIKFEVTGTLNELQNQLNRSTASLSQEQEKRLAQTIQFTEKIDE 
AKEfOCKnQTlA^GALGFFLYMILITYAGVTAQEVASEKGTKIMEVVFSSIRASHYFYARMMALFLVILTHIGrYWCGL 
AAVLLFKDLPFLAQSGILDHLCDAISLOTLLFILISLFMYVVUVAFLGSMVSRPEDSGKALSPLMILIMGGFFGVTALGA^ 
GDNLLLKIGSYIPFISTFFMPFRTINDYAGGAEAWlSLArrviFAVVATGFIGRMYASLVLQTDDLGIWKTFKRALSYKZ 

MTETIKLMKAHTSVRRFKEQEIPQVDLNEILTAAQMASSWKNFQSYSVIVVRSQEKKDALYELVPQEAIRQSAVFLLFV- 
GDLNRAEKGARLHTDTFQPQGVEGLUSSVDAALAGQNALLAAESLGYGGVIIGLVRYKSEEVAELFMLPDYTYSVFG 
MALGVPNQHHDMKPRLPLEhTVVFEEEYQEQSTEAlQAYDRVQADYAGARATTSWSQRLAEQFGQAEPSSTRKNLEQK 
KLLZMLKLlAlVGTNSKRSTNRQLLQYMQKHFTDKAEIELVEIKAIPVFNKPADKQVPAEILElAAiaEEADGVIICTPEYD 
30 HSIPAVLMSALAWI^YGIYPLL^^CPIMrTGASYGTLGSSRAQLQLRQILNAPEIKA^rVLPDEFLLSHSLQAFNPSGDLVDL 
DVDCKLDAIFDDFRIFVKITEKLRNAQELLRKDAEDFDWENLZ 

MhTTYQLNNGVEIPVLGFGTFKAKDGEEAYRAVLEALKAGYRHIDTAAlYQNEESVGQAnCDSGVPREEMFVaTKLWNS 
QQTYEQTRQALEKSIEKLGLDYLDLYLIHWPNPKPLRENDAWCTRNAEVWRAMEDLYQEGKIRAIGVSNFLPHHLDAL 
35 LETATIVPAVNQVRI^PGVYQDQVVAYCREKGILLEAWGPFGQGELFDSKQVQEIAANHGKSVAQULAWSLAEGFLP 
LPKSVTTSRIQANLDCFGIELSHEERETLKTIAVQSGAPRVDDVDFZ 

MRCKMLDPIAIQLGPlJVIRWYALCIVTGUl^VYLTMKEAPRKKIIPDDILDFILVAFPUVILGARLYYVIFRFDYYSQra- 
EIFAlWNGGLAIYGGLITGALVLYlFADRKUrn^DFLDLau\PSVMlAQSLGRWGNFFNQEAYGATVDNLDYLPGnRDQ 
40 MYIEGSYROPTFLYESLWNLLGFALILIFRRKWKSLRRGHITAFYLIWYGFGRMVIEGMRTDSLMFFGFRVSQWLSVVU 
„ GLGIMrVtYQNRKKAPYYITEEENZ 

MGKLSSILLCTVSGAALALFLTSDKGKQVCSQAQDFLDDLREDPEYAKEQVCEKLTEVKEQATDFVLKTKEQVESGErr 
VDSILAQTKSYAFQATEASKNQLNNLICEQWQEKAEALDDSEEIVIDITEE2 

MFTTKLIFWGSMLFLL^L^ILLTIYLAWIFYPMEIQWLNLTNRVYLKPETIQYNFHILMNYLTNPFSQVLQMPDFRSSAAG 
LHHFAVVKNLFHLVQLVALVTLPSFYVFVNRJVKKDFLSLYRKSLLALVVLPVMIGLGGVLIGFDQFFTLFHQILFVGD 
DTWLFDPAKDPVIMILPETFFLHAFLLFFALYENFFGYLYLKSRRKZ 

50 MTYHFTEEYDriVIGAGHAGVEASLAASRMGCKVLlJVTINIEMLAFMPCNPSIGGSAKGrWREVDALGGEMAJm 

YIQMKMLNTGKGPAVRALRAQADKELYSKEMRKTVENQENLTLRQTMIDEILVEDGKVVGVRTATHQEYAAKAVIVT 
TGTALRGEIIIGDLKYSSGPNHSLASINLADNLlCELGLEIGRFKTGTPPRVKASSINYDVTEIQPGDEVPNHFS'rrSRDEDY 
VKDQVPCWLTYTNGTSHEnQNNLHRAPMFTGVVKGVGPRYCPSIEDKIVRFADKERHQLFLEPEGRNTEEVYVQGLST 
SLPEDVQRDLVHSIKGLENAEMMRTGYAIEYDMVLPHQLRATLETKKISGLFTAGQTNGTSGYEEAAGQGtlAGrNAAL 

55 KIQGKPELILKRSDGYIGVMIDDLVTKGTIEPYRLLTSRAEYRLILRHDNADMRLTEMGREIGLVDDERWARFEIKKNQF 
DNEMKRLDSIKLKPVKETNAKVEEMGFKPLTDAVTAKEFLRRPEVSYQDVVAFIGPAAEDLDDKIIEUETEtKYEGYtSK 
AMDQVAKMKRMEEKRJPANIDWDDIDSIATEARQKFKLINPETIGQASRISGVNPADISILMVYLEGKNRSISKTLQKSKZ 

N^KQVLLVDDEEHILKLLDYHLSKEGFSTQLVTNGRKALALAETEPFDFILLD[MLPQLDGMEVCkRLRAKGVlCrPl^ 
60 MVSAKSDEFDKVLALELGADDYLTKPFSPRELLARVKAVLRRTKGEQEGDDSDNIADDSWLFGTLKVYPERHEVYKA 
NKLLSLTPKEFESDKNPFFEVFKVSKVTAQZ 

MTTFKDGFLWGGAVAAHQLEGGWQEGGKCISVADVMTAGRHGVAREITLGVLEGKYYPNHEAIDFYHRYKEDIALF 
AEMGFKCFRTSIAWTRIFPKGDELEPNEEGLQFYDNLFDECLKNGIEPVITLSHFEMPYHLVTEYGGWKNRKUDFFARF 
65 AEVVFKRYKDKVKYWMTFNEINNQANYQEDFAPFTNSGIVYEEGDNREAIMYQAAHYELVASARAVKIGHEINPDFQI 




GCMIAMCPIYPVTCNPKDILMAMKAMQKRYYFADVHVLGKYPEHIFKYWERKGISVDFTAQDKEDU^GTV 
YYMSFAlDSHRENNPYFDYLETEDLVKNNYVKASEWEWQrDPECLRYALhnVFTOHYHLPLFTVENGFGAlDQVAADG 
MVHDDYRmYLGAHlREMKKAWEDCVDU^GYTPWGCIDLVSAGTGEMRKRYGFlYVDKDDNGKGSYNRSPKKSFG 
WYKEVISSNGESVEZ 

5 

MIXJQNGLFGFLENHVMGPMGKlJVQFKVVRArrAAGMAAVPFTrVGSMFLVFSILPQAFSFWPIVADIFSASFDKFrS 
MVANYATMGSLSLYFVl^I^YELTKIYAEEEELNMNPLNGALl^LMAFVMTVPQnFDGGMMiaVTSLKEGAVUM 
WAMGhA^ARFGTTGIFTAIIMArVTVLIYRMCVKHNWVnCMPEAVPEGVSRGFTALVPGFVVAFVVIFINGLLVAM 
10 DnTCVL^IPFGFVSNLTNSWIGLMHYLLTQlXWIVGIHCANrVFAFVSPlALANMAENAAGGHFAVAGEFSNMFVlAGGS 
GATLGLCLYIAFASKSEQLKAIGRASVVPALFNINEPUFGLPIIYNPAI^IPFIU^PMVTATIYYVANSLNFIKPIUQVPWP 
TPVGIGAFLGTADLRAVLVALVCAFAAFLVYLPFIRVYDQKLVKEEQGI2 

MKKFYVSPIFPILVGLL\FGVLSTFIIFVNNNLLTVLILFLFVGGYVFLFKKLRVHYTRSDVEQIQYVNHQAEESLTALLEQ 
15 MPVGVMKLNLSSGEVEWFNPYAELILTKEDGDFDLEAVQTIIKASVGNPSTYAKLGEKRYAVHMDASSGVLYFVDVSR 
EQAITDELVTSRPVIGIVSVDNYDDLEDETSESDISQINSFVANFISEFSEKHMMFSRRVSMDRFYLFTDYrVLEGLMNDK 
FSVIDAFREESKQRQLPLTLSMGFSYGIXjNHDEIGKVALLNLNLAEVRGGDQWVKENDETKNPVYFGGGSAASIKRT 
RTRTRAMMTAISDKIRSVDQVFVVGHKNLDMDALGSAVGMQLFASNVIENSYALYDEEQMSPDIERAVSFIEKEGVTK 
LL^VKDAMGMVrrNRSLULVDHSFCTALTl^KEFYDLFTQTIVIDHHRRpQDFPDNAVITYIESGASSASELVTELlQFQNS 
20 KKNRl-SRMQASVl.MAGMMLDTKNrrSRVTSRTFDVASYLRTRGSDSlAlQElAATBFE£yR£VNEUL<K;RK 

AEAKDMKCYDTVVIS^O^ADAMLAMSGIEASFVI^KNTQGFlSISARSRSKLNVQRIMEELGGGGHFNLAAAQrKD 
EAGEKLTEIVLNEMKEKEKEE2 

MKEKNMWKELLNRAGWILVFLl^VLLYQVPLVVTSILTLK£VALLQSGUVAGLSiVVUVLFIMGARiCTKLASFNFSF 
25 RAKDLARLGL^YLVrVGSNILCSILLQL5NETTTANQSQINDMVQNSSLISSFFLLALljaLPICEEILCRGIVPKKIFRGKENL 
GFVVGTrVFALLHQPSNLPSLLIYGGMSTVLSWTAYKTQRLEMSILLHMIVNGIAFCLLALVVIMSRTLGISVZ 

MKEKNMWKELLNRAG^V^LVFLLAVLLYQVPLVV^:SlLTLKEVALLQSGUVAGLSIVV^^LFIMGARKTKLASFNFSFF 
RAKDLARLGLSYLVIVGSNILGSILLQLSNETTTANQSQINDMVQNSSLISSFFLLALLAPICEEILCRGIVPKKIFRGK^^ 
30 GFVVGTTVFALLHQPSNLPSLLIYGGMSTVI^WTAYKTQRLEMSILLHMIVNGIAFCLLJVLVVIMSRTLGISVZ 

MDTQKIEAAVKMIIEAVGEDANREGLQETPARVARMYQEIFSGLGQTAEEHLSKSFEIIDDNMVVEKDIFFHTMCEHHF 
LPFYGRAHIAYIPDGRVAGL^KI^RTVEVYSKKPQIQERLNIEVADALMDYLGAKGAFVVIEAEHMCMSMRGVRKPGT 
ATLTTVARGLFETDKDLRDQAYRLMGIJIMICDLFLKRKQAFRKECLGYLRYVLNDHFVLFLLVLLGFUVYQYSQLLQH 
35 FPENHWPILLFVGnrSVLLLLWGGTATYMEAPDKLFLLVGEEEIKLHLKRQTGtSLVFWLFVQTLFLLLFAPLFLAMGY 
GLPVFLLYVLLLGVGKYFHFCQKASKFFTETGLDWDYVISQESKRKQVLLRFFALFTQVKCISNSVKRRAYLDFILICAV 
QKVPGKIWQNLYLRSYLRNGDLFALSLRLLLLSLLAQVFIEQAWIATAVVVLFNYLLLFQLLALYHAFDYQYLTQLFPL 
DKGQKEKGLQEVVRGLTSFVLLVELVVGLITFQEKLALLALLGAGLVLLVLYLPYQVKRQMQDZ 

40 MRKSrVU^DNAYUPLETTIKSVLYHNRDVDFriLNSDIAPEWFKLLGRKMEVVNSTIRSVHIDKELFESYKTGPHINYA 
SYFRFFATEWESDRVLYLDSDIIVTGELATLFETDLKGYSIGAVDDVYAYEGRKSGFNTGMLLMDVAKWKEHSP/NSL 
LELAAEQNQVVHLGDQSILNIYFEDNWLALDKTYNYMVGIDIYHLAQECERI-DDNPPTTVHYASHDKPWNTYSISRLRE 
LWWVYRDLDWSEIAFQRSDLNYFERSNQSKKQVMLVTWSADIKHLEYLVQRLPDWHFHLAAPCDCSEELTSLSQYTN 
VTVYQNVLHSRIDWLLDDSIVYLDINTGCEVFNVVTRAQESGKKIFAFDrTRKSMDDGLYDGIFSVERPDDLVDRMKNI 

45 EIEZ 

MTKIYSSIAVKKGLFTSFLLFIYVLGSRIILPFVDLOTKDFLGGSTAYI^FSAALTGGNLRSL^IFSVGI^PWMSAM 
MFSFSKRLGLTSTSIEIQDRRKMYLTLLUVIQSLAVSLRLPVQSSYSAILVVLMOTIIXIAGTFFLVWLSDLNASMGIGG 
VlLLSSMVLNIPQDVLETFQTVHIPTGIIVLLALLTLVFSYLLALMYRARYLVPVNiaGLHNRFKRYSYLEIMLNPAGGMP 
50 YMYVMSFl^VPAYLnLLGnFPNHSGLAALSKEFMVGKPLWVYVYISVLFLFSIIFAFVTMNGEELADRMKJCSGEYrYGI 
YPGADTSRHNRLVLRFSVIGGLF^A^IMAGCPMLFVLFDEKLLRI^MIPGLFMMFGGMrFTIRDEVKALRLNETYRPLI2 

MSSLSDQELVAKTVEFRQRI^EGESLDDILVEAFAVVREADKRlLGMFPYDVQVMGArVMHYGNVAEMhTTGEGKTLT 
ATMPVYLNAFSGEGVMVVTPNEYL^KRDAEEMGQVYRFLGLTIGVPFTEDPKKEMKAEEICKLIYASDirrrrNSN 

55 YLNDNLASNEEGKFLRPFNYVIIDEIDDILLDSAQTPLIIAGSPRVQSNYYAIIDTLVTTLVEGEDYIFKEEKEEVWLTTKG 
AKSAENFLGIDNLYKEEHASFARHLVYAIRAHKLFTKDKDYIIRGNEMVLVDKGTGRLMEMTFCLQGGLHQAIEAKEHV 
KL^PETRAMASrrYQSLFKMFNKISGMTGTGKVAEKEFIETYNMSVVRlPTNRPRQRIDYPDNLYITLPEKVYASLEYIKQ 
YHAKGNPLLVFVGSVEMSQLYSSLLFREGIAHNVLNANNAAREAQIISESGQMGAVTVATSMAGRGTDIKLGKGVAEL 
GGLrviGTERMESQRJDLQIRGRSGRQGDPGMSKFFVSLEDDVIKKFGPSWVHKXYKDYQVQDMTQPEVLKGRKYRKL 

60 VEKAQHASDSAGRSARRQTLEYAESMNIQRDIVYKERNRLIDGSRDLEDVVVDIIERYTEEVAADHYASRELLFHHVTN 
ISFHVKEVPDYIDVTDCTAVR5FMKQVIDKELSEKKELLNQHDLYEQFLRLSLUCAIDDNWVEQVDYLQQLSMAIGGQS 
ASQKNPIVEYYQEAYAGFEAMKEQIHADMVRNLLMGLVEVTPKGEIVTHFPZ 

MIGTFAAALVAVLANFIVPIErrPNSANTEUPPDGlGQVLSNLLLKLVDNPVNALLTANYIRJL^WAVIFGUMREASKNS 
65 QELLKTUDVTSKIVEWnNLAPFGU-GLVFKTISDKGVGSLANYGlLLVLLVTTMLFVAPVVNPLIAFFFMRRNPYPLVW 
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NCLRVSGVTAFFITlSSATNIPVNMKLCHDLGLNPDTYSVSrPLCSTINMAGVAmm-LTL^ 
VAAISSCDASGlAGGSLLUPVACSLFGISNDLAlQrVGVGFVIGVIQDSCETALNSSTDVLFTAVAEYAATRKKZ 



5 MSISQRTTKUl^TClJ^CLLAYFLha^AVSAGIlALL5L^DTRRSTLKLAIWR^ 

LAFrVPLAYKMGWEIGrn>STVLVSHLLVQESrSPDLXVNEFU-FAlGTGFALLVNLYMPSR£EEIQHYHTLVEE^ 

LQRFKYYLSRGDGRNRAQLVAEU5TLLK£ALRLVYLDHSDHLFHQTDYHIHYFEMRQRQSRlLRNMAQQl>n^CHL^ 

ESLILAQLFSKIAGQLSQTNPASDIXDEIERYI^VFRNRSLPKTRHEFETRATLXQLLREAKTFIQVKVDFYQKYRQZ 

10 MEIMSLAL\VFAVnGLVIGYVSISAKMKSSQEAAELMLLNAEQEATNLRGQAER£ADLLVNEAXR£SKSLKKEALL£AK 
EEARKYREEVDAEFKSERQEUCQIESRLTERATSLDRKDDNLTSKEQTLEQKEQSISDRAK^n-DARHEQLEEVERQKEAE 
LERIGAl^QAEARDnLJ>LQTEEm,TREUSRIREAEQEVKERSDKMAKDlLVQAMQRIAGEYVAESTNSTVHLPDDTMKG 
RIIGRHGRNIRTFESLTGVDVIIDDTPEVVTLSGFDPIRRElARMTMEMLLKIXjRIHPARIEELVEKNRQEIDhnCIREY 
AAYEIGAP^n.HPDU4KIMGR^,QFRTSYGQNVLRHSIEVAKLAGlMASELGENAAU^RRAGFLHDIGKAlDHEVE 

15 IGMELARKYKEPPWVNT1ASHHGDVEAESVL\V!VAAADALSAARPGARSESL£SYIKRLHDLEE1ANGFEGVQTSFAL 
QAGREIR1MVNPGKIKDDKVT1LAHKVRKKIENNLDYPGNIKVTVIRELRAVDYAK2 

MMLKPSIDTLLDKVPSKYStVILEAKRAHELEAGAPATQGFKSEKSTLRALEEIESGhTVTIHPDPEGKREAVRRRIEEEKR 
RKEEEEKKKEQIAKEKEDGEKIZ 

20 

MSAYQLPTVWQDEASNQGAFTGLNRPTAGARFEQNLPKGEQAFQLYSLGTPNGVXVTILLEELLEAGFKEAAYDLYKI 
AIMDGEX?FGSDFVKLNPNSKIPALLDQSCTENVRVF£SAHILLYLAEKFGAFLPSNPVEKVEVLNWLFWQAGAAPFLG 
GGFGHFFNYAPEKLEYPINRFTMEVKRQLDLLDKELAQKPYIAGNDYTIADIAIWSWYGQLVQGNLYQGSAICFLDASS 
YQNLVKWAEKXANRPAVKRGLEVTYTEIK2 

U^LirsnMFYVGFDVLRDTIQKIL^REETVIDPLGATLGIISAAlMFVVYLYNTTRLSKKSNSNALKAAAra 
GTAL\IIJ»LSSFNYPIVDKLVAnrrFnLKTAYDinESSFSLSDGFDDRLLEDYQKAlMEIPKISKVKSQRGRTYGSNIYU)^ 
LEMNPDLSVFESHEUDQVESMLEERFGVFDTDVHIEPAPIPEDEILDNVYKKLLMREQLIDQGNQLEELLTDDFVYIRQ 
DGEQMDKEAYKTKKELNSAlKDIQrTSISQFOTCLICYELDGIIHTSnVRRHETWQNIFHQETK^ 

30 

NmKLVATDMDGTFLDGNGRFDMDRLKSLLVSYKEKGIYFAVASGRGFLSLEKLFAGVRDDIIFlAENGSLVEYQGQDL 
YEAT^^ROFYLATFEKUCTSPYVDrNKLLLTGKKGSYVLDTVDETYLKVSQHYNENIQKVASLEDr^DDIFKFn•NFrEE 
TLEDGEAWVNE^^VPGVKAMTTGFESIDIVLDYVDKGVA^VELVKKLG^^MDQVMAFGDNLNDLHMMQV 
NARPEILELAKTVIGHHKERSVIAYMEGL2 

35 

MADIKLIALDLDGTLLTTDFCRJLTDRTKETLQAARDRGIKVVLTTGRPLKAMDFFLHELGTDGQEDEYTITFNGGLVQ 
OTGEILDKTVFSYDDVARLYEETEKL^LPLDAlSEGTVYQIQSDQESLYAKFNPALTFVPVDFEDLSSQNrrYNKCVTAFA 
QEPLDAAIQKISPELFDQYEIFKSREMLLEWSPKNVHKATGLAKLISHLGIDQSQVMACCDEANDLSMIEWAGLGVAM 
QNAVPEVKAAANVVTPMT^fDEEAVAWAIEEYVUCE^fZ 

MESLULLLIANLAGLFLTWQRQDRQEKHL^KSLEDQADHLSEKJLDYRFDQARQASQLDQKDLEVVVSDRLQEVRIELH 
QGLTQVRQEMTDNLLQTRDKTDQRLQALQESNEQRLEQMRQTVEEKLEKTLQTRLQASFETVSKQLESVNRGLGEMQ 
TVARDVGALNKVl^GTKTRGn-GELQLCQIIEDIMTPAQYEREYATVENSSERVEYAIKLPGQGDQEYVYLPIDSKFPLA 
DYYRLEEAYETGDKDEIERCRKSLU^5VXRFARDIRNKYlAPPRTTNFGVLFVPTEGLYSErVRJ^JPVFFDDLRJl£EQIIVA 
45 GPSTl^ALLNSLSVGFKTLNIQKSADHISKTLASVICTEFGKFGGILVICAQKHLQHASGNIDELLNRRTLAIERTLRHreL^ 
GEPALDLLHFQENEEEYEDZ 

MKISHMKKDELFEGFYLIKSADLRQTRAGKNYLAFrFQDDSGEIDGKLWDAQPHNIEAFTAGKVVHMKGRREVYNNT 
PQVNQITLRLPQAGEPNDPADFKVKSPVDVKEIRDYMSQMIFKIENPVWQRIVRNLYTKYDlCEFYSYPAAiCTNHHAFET 
50 GLAYHTATMVRL^DAlSEVYPQLNKSLLYAGIMLHDl^KVIELTGPDQTEYTVRGMLLGHlALIDSErrKTVMELCIDD 
KEEVVLLRHVILSHHGLLEYGSPVRPRIMEAEIIHMIDNLDASMMMMSTAI^LVDKGEMTNKIFAMDNRSFYKPDLDZ 

MSEKAKKGFKMPSSYTVLUIIAIMAVLTWIPAGAHEGIYETQPQNPQGIWDVLMAPIRAMLGTHPEEGSLIKETS^ 
VAFnLMVGGFLGIVMKTGALDVClASIVKKYKGREKMULVU^^PLFALGGTTYGMGEETMAFYPLLVPVMMAVGFDS 
55 LTGVAnLLCSQIGCLASTLNPFATGlASATAGVGTGDGIVLRLIFWVTLTALSTWFVYRYADKIQKDPTKSLVYSTRKED 
LKHFNVEESSSVESTI^SKQKSVLFLFVLTFILMVl^npWTDLGVTIFDDFNTWLTGLPVIGNIVGSSTSALGTWYFPEG 
AMLFAFMGILIGVIYGLKEDKIlSSFMNGAADLLSVAUVAlARGIQVtMNDGMITDTILNWGICEGLSGLSSQVFrVVTYIF 
YLPMSFLIPSSSGLASATMGIMAPLGEFVNVRPSUrTAYQSA^GVLNLlAPTSGIVMGAl^LGRINlGTWWKFMGKLW 
HVVTIALLLLGTFLPFLZ 

60 

MSNSFVKLLVSQLFANIJ^DIFFRVTIUNIYUSKSVIATSLVPIUGISSFVASLLVPLVTKRI^LNRVLSLSQFGKTILI^^ 
VGMFTVMQSVAPLVTYLFVVAISILDGFAAPVSYAIVPRYATDLGKANSALSKfrGEAVQLIGWGLGGLLFATIGLLPTT 
CINLVLYIISSFLMLFLPNAEVEVLESETNLEI1XKCWKLVARNPRLRLFVSANLLE[FSNTIWVSSIILVFVTELLNICTESY 
WGYSrn-AYSIGIIISGLUFRLSEKFUUOCWEPQLFTPNLKTIQNPCLSLDPGWFLFSPNGCFLLDKKEFPLYGIS^^ 
65 RKETHMNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYLVTNDQRRYCRYSDSDILVQFLFQLL 
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TGYGTDYACKELSADAYFPKLLEGGQLASQPTLSRFLSRTDEETVHSIJlCLNLELVEFFLQFHQLNQLrVDIDSTHF^ 
GKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEEADSFirPVLERFNQLLFRMDSGFATPKLYDUE 
KTGQYYLIKiJCK>m^LSRLGDLSLPCPQDEDLTILPHSAYSETLYOAGSWSHKRRVCQFSERKEGNLFTO 
GTSQIXJR^LYRGRGQAENFlKEMKEGFFGDKTDSSTLIKNEVRMMMSCUYNLYmJCHLAGGDFQTLTIKRFRHL^ 
5 HVVGKCVRTCRKQLUCLSSLYAYSELFSALYSRJRKVNLNLPVPYEPPRRKASLMMHZ 

MMEFFQQLPHLEPYGNPQYFVYVlAATLPIFIGLFFKKRPAWYEVLVSLFnVTMLVGGKTNQLAALGIYLCWE 
YKHYRKSKDGKWVFYLVSFLSLLPIIFVKVQPAINGTQSLLGFLGISYLTFRSVGIVIELRTCVKDFTLWEFLRFLLFMPT 
FSSGPIDRFKRFNENYQAIPERDELMDMLDESVRYIMWGFLYKRLAHVLGETLLPPLKNLALQSGGFFNLYALAVMYT 
10 FGLELFFDFAGYSMFALAISNLMGrRSPINFNKPFLSRDLKEFWNRWHMSLSFWFRDFVFMRMVMVLTRKKVFK^^^ 
VTSSMAYrVNMUMGFWHGVTWYYlAYGLFHGLGLVINDAWVRKKKTLNKERKKAGKi^ 

wmlsflrfsgflnnlwfkkz 

mlkrlwmifgpvll^gllvflliffyptemhhnlgaekrsavattidsfkersqkvralsdpratrfvpffgssewlrfd 
15 gahpavlaekynrsyrpyllgqggaaslnqyfgmqqmlpqlenkqvvyvispqwfskngydpaafqqyfngdqlts 
fuchqsgdqasqyaatrllqqfpnvamicdlvqklasiceel^adkemielljvrfnerqasffgqf 
akylkilpdqfsyqaledvvkadaekhn^nnemgmenyfyneqikkdlkklkdsoksftyucspeyndlqlvltqfsk 

SKVNPIFIIPPVNKKWMNYAGLREDMYQQTVOKIRYQLESQGFTNIADFSKDGGEPFFMKDTIHLGWLGWLAFDKAVD 
PFLSNPTPAPTYHLNERFFSKDWATYDGDV)CEFQZ 

MEKNLKO^LKQTTDOEGPAIEPEKAEDTKTVQNGYFEDAAVKDRTLSDYAGNWQSVYPFLEDGTFDQVFDYKAKLTG 
KMTQAEYKAYYTKGYHTD\n-KINrrDfn"MEFVQGGQSKKYTYKYVGmLTYKKGNRGVRFLFEATDADACQFKYV 
QFSDHNVAPVKAEHFHIFFGGTSQEALFEEMDNWPTYYPDNLSGQEIAQEMLAHZ 

25 MKDGHL^^HHIRLLNGR1FQKLL^QDPEALYRGEQGK1LJ^VLWNSETGCATATDIA1^TGLAN^^^.TTMIKKL^^ 
VIVSPCGKDKRKXYLVLTELGKSQKEVGHRVSQKLDTIFYKGFSEEEIHQFEGFQERILANUCEKGNEV2 

MTNLUTFQDRFSDWLTALSQHLOI-SLLTLLLAILLAIPLAVFLRYHEKLADWVLQUCIFOTIPSLALLGLFIPLMGIG^^ 
PALTALVIYAIFPILQ^^•^^GLKGIDPNLQ£AGUFGMTRWERLKKFEIPLAMPVIMSGIRTAAVL^GTATLAALIGAGCL 
30 GSFlLLGIDRNNASLILlGALSSAVLAUFNFLLKVMEICaLKLRTIFSGFALVALLLGLSYSPALLVQKEKENLVlAGFaGPEP 
EILANMYKLLIEEhrrSMTATVKPNFGKTSFLYEALKKGDIDIYPEFTGTVTESLLQPSPKVSHEPEQVYQVARIXiIAK^ 
HLAYLKPMSYQhTTYAVAVPKKIAQEYGLKTlSDLKKVEGQLKAGFTXEFNDREDGNKGLOSMYGLNLNVATIEPALRY 
QAIQSGDIQrrDAYSTDAELERYDLQVLEDDfCQLFPPYOGAPU^KEALUCKHPELERVLOTLAGKITESQMSQLNYQVG 
VEGKSAKQVAKEFLQEQGLLKKZ 

MMHTYLQKKIENIKTTLGEMSGCYRRMVAAMADLGFSGTMKAIWDDLFAHRSFAQWIYLLVLGSFPLWLELVYEHRI 
VDWIGMICSLTGIICVIFVSEGRASNYLFGLINSVIYLILJ^LQKGFYGEVLTTLYFTVMQPIGLLVWIYQAQFICKEKQEF^ 
ARKLDGKGWTKYLSISVLWWLAFGFIYQSIGANRPYRDSrrDATNGVGQIU^AVYREQWIFWAATNVFSIYLWWGES 
LQIQGKYUYUNSLVGWYQWSKAAKQNTDLLNZ 



MRNMKAKYAVWVAFFLNLTYAIVEFIAGGVFGSSAVLADSVHDLGDAIAIGISAFLETISNREEDNQYTLGYKRFSLLC 
ALVTAVILVTGSVLVILENVTKILHPQPVNDEGILWLGlLaLmNLLASLVVGKGKTKNESlLSLHFLEDTLGWVAVlLMAI 
VLRFTDWYILDPLLSLVISFFILSKALPRFWSTLKIFLDAVPECLDIKQVKSGLERLDNVASLNQLNLWTMDALEKNAIV 
HVCLKEMEHMETCKESIRJFLKDCGFQNrriEIDADLETHQTHKRKVCDLERSYEHQHZ 

MIEYKNVALRYTEKDVLRDVNLQIEIXJEFMVLVGPSGSGKTTMLKMINRLLEPTDGNIYMDGKRIKDYDERELRLSTC 
YVLQAULFPNLTVAENULIPEMKGWSKEErTKKTEELLAKVGLPVAEYGHRLPSELSGGEQQRVGIVRAMIGQPKIFL 
MDEPFSALDAISRKQLQVLTKELHKEFGMmFVTHDTDEALKLADRJAVLQDGEIRQVANPETILKAPATDFVADLFC 
GSVHDZ 



MSAVAISAMTKVMQETHGNPSSIHGHGRQAGKLLREARQELAQLLRTKPQHIFFTSGCTEGNNTTTIIGYCLRHQEQGKH 

irrTAlEHHAVLrriDYLVQHFGFEATIIQPENOEITAQQIQKALRDDTlLVSTMFVNNETGNLLPlAEIGQILKQHPAAYH 

VDAVQAIGKIPIHSEELGIDFLTASAHKFHGPKGIGFLYASSMDFDSYLHCGDQEQKKRAGTENLPAIVGMVAALKEDL 

EKQEEHFQHVQNLETAFLAELEGIQYYLNRGKHHLPYVLNIGFPGQKNDLLLLRLDLAGISISTGSACTAGVVQSSHVLE 

AMYGANSERLKESLRISLSPQNTVEDLQTLAKTLKEIIGGZ 



MLFKLSKEKIELGL^Rl^PARRIFL^FALVILLGSLLLSLPFVQVESSRATYFDHLFTAVSAVCVTGLSTLPVAHTYNrWG 

QIICLLLIQIGGLGLMTFIGVFYIQSKQKLSLRSRATI0DSFSYGETRSLRKFVYSIFLTTFLVESLGAILLSFR1.IPQLGWGR 

GLFSSIFLAISAFCNAGFDNLGSTSLFAFQTDLLVNLVIAGLIITGGLGFMVWFDLAGHVGRKKKGRLHFHTKLVLLLTI 

GLLLFCTATTLFLEWNNAGTIGNLPVADKVLVSFFQTVTMRTAGFSTIDYTQAHPVTLLIYILQMFLGGAPGGTAGGLK 

rTTFFVLLVFARSELLGLPHANVARRTlAPRTVQKSFSVFIIFLMSFLIGLILLGrrAKGNPPFIHLVFETiSALSTVGVTANL 

TPDLGKLALSVIMPLMFMGRIGPLTLFVSLADYHPEKKDMIHYMKADISIGZ 
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MSDRTIGILGLGrFGSSVLj\ALAKQDMNIUIDDHA£RINQFEPVLARGVIGDrTDEELLR5AGIDTCDTVVVATGENLESS 
VLAVMHOCSLGVPTVlAKVKSQTAKKVLEKIGADSVISPEYEMGQSLAQTILFHNSVDVFQLDKNVSrVEMKIPQSWAG 
QSLSKLDLRGKYNLNILGFR£QENSPLDVEFGPDDLLIC\DTYILAVINNQYLDTLVALNS2 

MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGIUINDGSQDQTQElAECLASKYPNrVRAlY0ErOCCHGGAVNRGLVE>^ 
GRYFKVVDSDDWVDPRAYLXILETLQELESKGQEVDVFVTNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSK 
GQ YTMMHS LI YRTDLLRASQF2 



MKFNPNQRYTRWSIRRLSVGVASVVVASGFFVLVGQPSSVRADGLNPTPGQVLPEETSGTKEGDLSEKPGDTVLTQAKP 
EGVTGOTNSLPTPTERTEVSEETSPSSLDTLFEKDEEAQKNPELTDVLKETVDTADVDGTQASPAErrPEQVKGCVKEN 
TKDSIDVPAAYLEKAEGKGPFTAGVNQVrPYELFAGDGMLTRLLLKASDNAPWSDNGTAJCNPALPPLEGLTKGKYFYE 
VDLNG^^^GKQGQAUDQLRANGTQTYKATVKVYGNKDGKADLTNLVATK^^VDININCLVAKETVQ^u\VADN^ 
15 IDVPAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFYQV 
AlX^GNVAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGLISKETVQKAVADrA^KDSIDV 
PAAYLEKAKGEGPFTAGVNHVIPYELFAGIXjMLTRLLLKASDKAPWSDNGDAKNPALSPLCEm^KTKGQYFYQLAL^ 
GrA^AGKEKQAUDQFRANGTQTYSATVNVYGNKDGKPDLDNrVATKKVTININGLISKETVQKAVADNVKTVSMFQQP 
T2 

20 • - 

MKUCSYILVGYIISTLLmVVFWAVQKMUAKGEIYFLLGNrrrVASLVCAGISLFLLLPVFTSLGKLKEHAK 
SNLEVQGPVEFQQLGQTFNEMSHDLQVSFDSLEESEREKGLMIAQLSHDIKTPITSIQATVEGILDGIIKESEQAHYLATIG 
RQTERLNKLVEELNFLTLhTTARNQVETTSKDSIFLDKLLIECMSEFQFLIEQERRDVHLQVIPESARIEGDYAiaSRILVN 
25 LVDNAFKYSAPGTKLEVVAKLEKDQLSISVTDEGQGIAPEDLENIFKRLYRVETSRNMKTGGHGLGLAURELAHQLGG 
EITVSSQYGLGSTFTLVLNLSGSENKA2 

MFGQTAQHGLTNSLKDFWlFLLNIGPQl^FFCQMLRCSRSVEQGTGNHRREFNMIQQIFSHFGMTHLGQtKLVYQESID 
30 LEU,VNALNHHLLIDRi.VLTPNQmEIDRQIVHGLDLLKGRKDKEIIDIKSMFRQLELASTQQICPINQRVHHGlLAFGEIS 
DLVPAKNLPKRQD2 

MEHLATYFSTYGGAFFAALGrVLAVGL^GMGSAYGVGK^GQSAAAlXKEQPEKFASAULQLLPGTQGLYGFVIGIUW 
35 LQLTPELPLEKGVAYFFVALPlArVGYFSAICHQGNVAVAGMQlLAKRPlCEFMKGAILAAMVETYAlLAFVVSnLTLRVZ 

MLKSEKQSRYQMLNEELSFLLEGETNVLANLSNASALIKSRFPNTVFAGFYLFDGKELVLGPFQGGVSCIRIALGKGVC 
- GEAAHFQETVIVGDVTTYLNYISCDSLAKSEIVVPMMKNGQLLGVLDLDSSEIEDYDAMDRDYLEQFVAILLEKTAWD 
40 FTMFEEKS2 

MSVLEnCDLHVEIEGKEILKGVNLTLCTGElAAIMGPWTGKSTLSAAlMGNPNYEVTKGEVLFDGVNILELEVDERAR 
MGLFLAMQYPSEIPGITNAEFLRAAMNAGKEDDEKISVREFITKLDEKMELLNMKEEMAERYLNEGFSGGEKKRNEIL 
45 QI-i-MLEPTFALLDEIDSCLDIDALICVVSKGVNAMRGEGFGAMIITHYQRLl.NYrrPDVVHVMMEGRWLSGGPELAAR 
LEREG Y AKL AEELG Y D YKEEL2 



MPYKRQRSFSMALSKLDSLYMAVVADHSKNPHHQGKLEDAEQlSLNNPTCGDVINLSVKFDAEDRLEDIAFLNSGCnS 
TASASMMTDAVLGKTKQEILELATIFSEMVQGQKDERQDQLGDAAFL5GVAKFPQRIKCATLAWNALKKTIENQEKQZ 



MKIQDLLRXDVMLLDLQATEKTAVIDEMIKNLTDHGYVTDFETFKEGILAREALTSTGLGDGIAMPHSKNAAVKEATV 
LFAKSNKGVDYESLDGQATDLFFMtAAPEGANDTHLAALAELSQYLMKDGFADKLRQATSADQVIELFDQASEKTEEL 

DO VQAPANDSGDFIVAVTACTTGIAHTYMAQEALQKVAAEMGVGIKVETNGASGVGNQLTAEDIRKAKAIIIAADKAVEM 
DRPDGKPLINRPVADGIRKTEEUNLALSGDTEVYRAANGAXAATASNEKQSLGGALYKiiLMSGVSQMLPFVIGGGIMI 
ALAFLIDGALGVPNENLGNLGSYHELASMFMKIGGAAFGLMLPVFACYVAYSlAEKPGLVAGFVAGAlAKEGFAFGiaP 
YAAGGEATSTLAGVSSGFLGALVGGFiAGALVLAIKKYVKVPRSLEGAKSILLLPLLGTILTGFVMLAVNIPMAAINTAM 
NDFLGGLGGGSAVLLGIVLGGMMAVDMGGPVNKAAYVFGTGTLAATVSSGGSVAMAAVMAGGMVPPLAIFVATLLF 

OU KDKFTKEERNSGLTNIIMGLSFtTEGAIPFGAADPARAlPSFILGSAVAGGLVGLTGnCLMAPHCGIFVlALTSNALLYLVS 
VLVGAIVSGVVYGYLRKPQA2 



MANK^^^STTRRRPSKAELERKEAIQRMUSLGIAILLIFAAFKLGAAGr^LYNURLLVGSLAYLAIFGLLIYLFFFKWtRK 
QEGLl^GFFriFAGLLLIFEAYLVWKYGLDKSVLJCGTMAQVVTDLTGFRTTSFAGGGUGVALYIPTAFLFSNIGTYFIGS 
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ILILVGSLLVSPWSVnrDIAEFFSRGFAKWWEGHERRKEERFVKQEEKARQKAEKEARtEQEETEKALU^LPPVDM 
ILTEEAVQNLPPIPEEKWVEPEIlLPQ>V£LKFP£QEDDSDDEDVQVDFSAlCEALEVFCLPSLQLFAPDKPKDQSKEKKrVR£ 
NIKILEATFASFGIKNTTVERAEIGPSVTKYEVKPAVGVRVNRISNL^DDLALALAAKDVRIEAPIPGKSUGm 
SFRELWEQSQTKAENFLEIPLCKAVNCTARAFDLSKMPHLLVAGSTCSGKSVAVNGIUSn^lfCARPDQVKFMMVDPK 
5 MVELSVYNDlPHIXIPVVTNPRKASKALQKVVDEMENRYELFAKVGVRNlAGFNAKVEEFNSQSEYKQIPLPFIVVrVDE 
LADLMMVASICEVEDAZIRLGQKARAAGIHMILATQRPSVDVISGLIKANVPSRVAFAVSSGTDSRTILDENGAEKLLGRG 
DMLFKPIDENHPVRLQGSFISDDDVERIVNFIKTQADADYDESFOPGEVSENEGEFSDGDAGGDPLFEEAKSLVIETQKA 
SASMIQRRLSVGFNRATRLMEELEIAGVIGPAEGTKPRKVLQQ2 

10 

MSYFKKYKFDKSQFKLGMRTFKTGIAVFLVLUFGFFGWKGLQIGALTAVFSLRESFDESVHFGTSRILGNSIGGLYALV 

FFU-^^•FFHEAFWVTLVVVPICTMLTIMTNVAMN^^KAGVIGGVAAMLI^TLS^PSGE^ 

YDIDRIRLFLEKKEK2 

15 

- M^nCSEHRHQLIRAL^^KNKIHTQAELOALLAENDIQVTQATLSRDIKNMNL^KVREEDSAYYVLNNGSISKWE 

MEDAL VWMRPVQHQVLLKTLPGLAQSFGSIIDTLSFPDAIATLCGNDVCLIICEDADTAQKCFEELKKFAPPFFFEEZ 



20 MKSlKLNALSYMGmVLNnFPILTGTYVARVLDRTDYGYFNSVDTIL^FFLPFATYGVYNYGLRAIShTVm 

FSSLFYLCUCTILTTAVYIU^YPUTTDNPIVKKVYLVMGIQLlAQIFSIEWVNEALENYSFLFYKTAFrRILMLVSIFL^ 
NEHDrvVYTLVMSLSTLINYLISYFWIKRDIKLVKIHLSDFKPLFLPLTAMLVFANANMLFTFLDRLFLVKTGlDVNVSY 
YTlAQRlVTVUGVVTGAlGVSVPRl^YYLGKGDKEAYVSLVIsfRCSRIFNFRIPLSFGLMVLGPNAlLLYGSEKYIGGGIL 
TSLFAFRTnLALDTILGSQILFTNGYEKRTT^YTVFAGLLNLGLNSLLFFNHIVAPEYYLLTrMLSETSLLVFYnFIHRK^ 

25 IHLGHIFSYTVRYSU^SLSFVAIYFUNFVYPVDMVINLPFUI^GLrVLLSAISYlSLLVFTKDSIFYEFLNHVLA^ 

sz ' 



MELFMKrrNYErmJCKSGLTNQQILKVLEYGENVDQELLLGDUDISGCRNPAVFMERYFQIDDAHLSKEFQKFPSFSIL 
30 DDCYPWDLSEIYDAPVU-FnCGNLDUJCFPKVAWGSRACSKQGAKSVEKVIQGLENELVIVSGLAKGIDTAAHMAAL 
QNGGKTlAVIGTGLDVFYPKANKRLQDYIGNDHLVLSEYGPGEOPLKFHFPARNRIIAGLCRGVIVAEAKMRSGSLn'CE 
RAMEEGRDVFAIPGSILDGLSDGCHHLIQEGAKLVTSGQDVLAEFEFZ 



35 MKQLTVEDAKQIELEILDYIDTLCKKHNINYIINYGTUGAVRHEGFIPWDDDIDLSMPREDYQRFINIFQKEKSKYKLLS 
LETDKNYFNNFIKn-DSTTKnDTRhrnCTYESGIFIDIFPIDRFDDPKVIDTCYKLESFKLLSFSICHKNIVYKDSLLKDWIRT 
AFWLLLRPVSPRYFANICIEKEIQKYSRENGQYMAFIPSKFKEKEVFPSCTFDKTIDLPFENLSLPAPEKFDTILTQFYGDY 
MTLPPEEKRPYSHEFHAYKLEDZ 

40 

MTKINHLTITQNKDLRDLVSDLTMTIQDGEKVAnGEEGNGKSTLLKILMGEALSDFTIKGNIQSDYQSLAYIPQKVPEDL 
KJCKTLHDYFFLDSIDLDYSILYRLAEELHFDSNRFASDOEIGNLSGGEALKIQLIHELAKPFEILFLDEPSNDLDLETVDW 
LKGQIQKTRQTVinSHDEDFLSETADTIVHLRLVKHRKEAETLVEHLDYDSYSEQRKANFAKQSQQAANNQRAYDKT 
MEKHRilVKQNVETALRATKDSTAGRI-LAKKMKTVLSQEKRYEICAAQSMTQKPLEEEQIQLFFSDIQPLPASKVL^ 
45 KENLSIDDRVLVQKLQLTVRGQEKIGIIGPNGVGKSTLLAKLQRLLNDKREISLGFMPQDYHKKLQLDLSPIAYLSKTGE 
tCEELQKIQSHLASLNFSYPEMQHQIRSLSGGQQCKLLLLDLVLRKPNFLLLDEPTRNFSPTSQPQIRKLFATYPGGLITVS 
HDRRFLKEVCSriYRMTEHGUCLVNLEDLZ 



50 MKPKTFYNLLAEQNLPLSDQQKEQFERYFELLVEWNEKINLTAITDKEEVYLKHFYDSIAPILOGUPNETIKLLDIGAGA 
GFPSLPMKILYPELDVTIIDSLNKRJNFLQLLAQELDLNGVHFYHGRAEDFAQDKNFRAQYDFVTARAVARMQVLSELT 
IPYLKVGGKLLAUCASNAPEELLEAKNALNLLFSKVEDNLSYALPNRDPRYITVVEKKKETPNKYPRKAGMPNKRPL2 



55 MSIKUAVDIDGTLVNSQKEITPEVFSAIQDAKEAGVKVVIATCRPIAGVAKLLDDLQLRDEGDYVVTFNGALVQETATG 
HEIISESLTYEDYLDMEFLSRiaGVHMHArrKDGIYTANRNIGKYTVHESTLVSMPIFYRTPEEMAGKEIVKCMFIDEPEI 
LDAAIEKIPAEFYERYSINKSAPFYLELLKKNVDKGSAITHLAEKLGLTKDETMAIGDEENDRAMLEVVGNPVVMENGN 
PElKKIAKYrrKTNDESGVAHAIRTWVLZ 

60 

MTWIILGVIAUVIFVIVSYNGLVKNRMQTKEAWSQIDVQLKRRNDLLPNLIETVKGYAKYECSTLEKVAELRNQVAAA 
TSPAEAMKASDALTRQVSGIFAVAESYPDLKASANFVKLQEELTNTENKISYSRQLYNSVVSNYNVKLETFPSNllAGMF 
GFKAADFLQTPEEEKSVPKVDFSGLGDZ 

65 



83 



MLFTXJlASNKRKTWILLLVFFU-LALVGYAVGYLnRSGLGGLVULIIGFIYALSMIFQSTEIVMSMNGAR^^ 
LYHVVEDMALVAQIPMPRVniDDPALNAFATGSNFKJNAAVAATSGLLAlMNREELEAVMGHEVSHIRNYDIRISTlAV 
ALASATTML^SMAGRMMWWGGAGRRJ^DDDRDGNCLElIMLVVSLLAIVtAPLAATLVQLAlSRQR^ 
^ RNPQGMINALDKLDNSKPMSRHVDDASSALYINDPKKGGGFQKLFYTHPPISERIERl-KQMZ 

MiaNIQEIRKOSEGLNFEQTLDLVDDLRARNQEILDVKDILAVGKVQYEDRMYFLDYQLSYTIVLASSRSMEPVELVES 
YPVTEVFMEGATNQLDQEVLDDDLVLPIENGELDLAESVSDNILLNIPIKVLTAEEEAGQGFISGNDWQINrrEEEYQAQ 
KAVKKEENSPFAGLQGLFDGDE2 

10 

MKRQLALWFSGGQDSTTCLFWVMQHYETVEAVTFAYGQRHHLEIQtTRElAICEQGlRHHILDMSLLGQITAQPDFATl 
HISYIPDKLCVESKSLKLYLFSYRNHGDFHENCINTIGKDLVNLLDPRYLEVWGKFTPRGGISIDPYYNYGKQGTKYEGL 
AEQRLFQHDLYPEKIDNRZ 

N^ETVEDKVSHSn^GLDILKGIVAAGAVISGTVATQTKVFTNESAVLEKTVEKTDALATNDTVVLGTISTSNSASSTSLSA 
SESASTSASESASTSASTSASTSASESASTSASTSISASSTVVGSQTAAATEATAKKVEEDRiCKPASDYVASVTrA^NLQSYA 
KRRKRSVDSIEQLLASIKNAAVFSGhn-IVNCAPAlNASLNlAKSETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTFIT^ 

20 VTYVNPKTNDLCNISSMRPGYSrYNSGTSTQTMLTLGSDLGKPSGVK^^Y^rDKNGRQVI^Y^^•ST^m■QGSGYTWGNG 
AQMNGFFAKKGYGLTSS^^^^VPr^GTDTSF^FTPYAARTDRJG^NYFNGGGKVVESSTTSQSL^QSlCSL5VS^ 
TSASASASTSASASASTSASASASTSASVSASTSASASASTSASA5ASTSASESASTSASASASTSASASASTSASASASTSASE 
SASTSASASASTSASESASTSA^ASASTSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASESASTST 
SASASTSASESASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSASASAST^ 

25 ASVSASTSASA5ASTSASASASTSASESASTSASASASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASA 
STSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASESASTSTSASASTSASESASTSASASASTSA^ 
SASTSASASARQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQHQPVPRLQRQPVRQLQQVPVLQSQHQQVLQ 
PQHRQVPRLQQAHQHLNQRRQAPQLQQVPVRQPQRRQVRQPQQVLVHQLQHQRVHRLRRQPVHQSQQVPVRQLPHQ 
QVPRLQQAPVRRLQQVLAPQPQPQPVRQPQQVSQRLNRHQRVRPLQQVLAPQPQRQQVHRLQRQRVRLNRHQRVRPL 

30 QQVLAPQPQRQQVHRX-QHQRVRPLQQVLAPQPQRQQVHRLQRQRVRLSQHQRVRQPQQAHQLLNLHQPVRQPQHRQ 
APQLQQVPVRQPQRRQVRRLQQVPVRQPQQVPVRQPQRRQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQH 
QPVHQSQQVPVRQFRINKCLGFSKY2 

35 MGVETWFYSSICWLAIGLGSVWKFPYN^AANGGGGFLLIFLISmiGFPLLLAEFALGRSAGVSAlKTFGKLGK 

FIGWIGAFALFILLSFYSVIGGWILVYLGIEFGKLFQLGGTGDYAQLFTSIISNPAIALGAQAAFILLNIFIVSRGVQKGIERA 
SKVMMPLLFIVFVFIIGRSL^LPNAMEGVLYFUCPDFSKLTSTGLiYALGQSFFAL^LGVTVMLTYASYUDICKTNLVQSG 
ISIVAMNISISIMAGLAIFQARSPFNIQSEGCPSLLFTVLPQLFDKMPFGTIFYVLFLLLFLFATVTFSVVMLEINVDNITNQD 
NSKRAKWSVILGILTFVFGIPSALSYGVMADVHIFGKTFFDAMDFLVSNLLMPFGALYLSLFTGYIFKKALAMEELHLD 

40 ERAWKQGLFQVWLFLLRFFVSSFQSSSLWSSLPNLCNQKGLEZ 

MLKKWQLXDVIL1JVFL^IFFGGVFVGSGYVYNIL^LLLTPLGLQAFANEILFGLWCMAAPL\AIFVPRVGSAT^^ 
LAEVLYGSQFGLGALLSGFVQGLGSEFGFIVTKNRYESWLSLTANSIGITLVSFVYEYIICLGYYAFSLPFVLSLLVVRFISV 
45 YFFCTILVRAIVKLYHQFATGGKA2 

MVKVATQTPIISLFULILSLETSRPSiALTLSVVAFCILFMLYYRRFKMLAWMULAILPSFANYWAVQLHGDASQAVML 
GTRAFVTVCICLVFVSSVSLKELLLYLAQKGl^RSWSYAUVVFNSFPLIQQEIKSLKEACLLRGQELHFWSPLIYSKVLM 
50 TVFRWRKLYLRALSAHGYDEHAQLKNSYRTFYIPKKTKUYLLFFLLLQTSLFL2 

MRKHQLQVHKLTILSMMIALDWLTPIFRJEGMAPMSSVVNILAGIMMGPVYALAMATVTARRNTTTQGIPPI^^ 
FGALLAGLFYKYGRKFHYSALGEILGTCIIGSIVSYPVMVLFTGSAAKLSWFIYTPRFFGATLIGTAISFIAFRFUKQEFFK 
55 KVQGYFFSERIDZ 

MQEFTNPFPIGSSSLIHCITNEISCEMLANCILALGCKPVMADDSREVLDFTKQSQALFINLGHLSAEKEKAIRMAASYAN 
QSSLPMVVDAVGVTTSSIRKSLVKDLLDYRPTVLKGNMSEIRSLVGLKHHGVGVDASAKDQETEDLLQVLKDWCQTYP 
60 GMSFLVTGPKDLVVSKNQVAVLGNGCTELDWrrGTGDLVGALTAVFLSQGKTGFEASCLAVSYLNlAAEKIVVQGMG 
LEEFRYQVLNQLSLLRRDENWLDTIKGEVYE2 



MNHKIAILSDVHGNATALEAVIADAKNQGASEYWLLGDIFLPGPGANDLVALLKDLPITASVRGNWDDRVLEALDGQ 
YGLEDPQEVQLLRMTQYLMERMDPATIVWLRSLPLLEIOCEIDGLRFSISHNLPDKNYGGDLLVENDTEKFDQLLDAET 



% % 

84 



DVAVYGHVHKQLLRYGSQGQQIINPGSIGMPYFNWEALKNHRSQYAVIEVEDGELLNIQFRKVAYDYEAELELAKSKG 
LPFIEMYEELRRDDNYOGHNLELLASUEKHGYVEDVKNFFDFLZ 



5 MNVNQIVRIIPTLKANNRKLNETFYIETLGMKALLEESAFI^LGDQTGLEKLVLEEAPSMRTRKVEGRKKLARLIVKVE 
NPLEIEGILSKTDSIHRLYKGQNGYAFEIFSPEDDLILIHAEDDIASLVEVGEKPEFQTDLASISLSKFEISMELHLPTDIESF 
LESSEIGASLDFIPAQGQDLTVDhrrvnrWDl^MLKFLVNELDLASLRQKFESTEYFIPKSEKFFLGKDRNNVELW 



10 MKWKnKKIEEQIEAGIYPGASFAYFKDNQWTEFYLGQSDPEHGLQTEAGLVYDLASVSKVVGVGTVCTFLWEIGQLD 
IDRLVIDFLPESDYPDITIRQLLTHATDLDPFIPNRDLLTAPELKEAMFHLNRRSQPAFLYSDVHFLLLGFILERIFNQDLD 
VIUCDQVWKPWGN^ETKFGPVELAVPTVRGVEAGrVHDPtu\RLLGRHAGSAGLFSTUCDLQIFLEHYLADDFARDLNQ 
NFSPLDDKERSLAWNLECDWLDHTGYTGTFIMWNRQKQEATIFLSNRTYEKDERAQWILDRNQVMNURKEEZ 

15 

MMKKTYNHILVWGVIFYSICIVCFCFTPQEQSTVGVGTPG^QHLGRLVFLLTPFNSLWKLGEVSDIGQLCWlFLQNIL^Ar 
FLFFPUFQLLYLFPNLRKTKKVLLFSFLVSLGIECTQLILDFFFDFNRVF£IDDL^^^r^LGGYLAWLLYKRLHK^rKVRN 
Z 

20 

MKIPLLTFARHKFVYVLLTLLFLALVYRDVLNrrYFFFDIHAPDLAKFDGQAlKNDLLKSALDFRILQFNLGFYQSFriPIU 
VLLGFQYIELKNKVLRLSIGREVSYQGLKRKLTLQVASIPCLrYLVTVLnAriTYFFGTFSPLGWNSLFSDGSGLQRLLDGE 
DCSYLFFTCVmCIFlNAlYFLQIVDYVGNVTRSAITYLMFLWLGSMLLYSALPYYMVPMTSLMQASYGDVSLMKLFTP 
YILYrVPYMVLEKYEDNVZ 

25 

MFKVLQKVGKAFMLPIAlLPAAGLLLGIGGALSNPTTlATYPILDNSIFQSIFQVMSSAGEVVFSNLSLLLCVGLaGLAKR 
DKGTAAUVGVTGYLVMTATIKALVKLFMAEGSAIDTGVIGALVVGIVAVYLHNRYNNIQLPSALGFFGGSRFVPIVTSF 
SSILIGFVFFVIWPPFQQLLVSTGGYlSQAGPIGTFLYGFLMRLSGAVGLHHnYPMFWYTELGGVETVAGQTVVGAQKIF 
30 FAQLADLAHSGLFTEGTRFFAGRFSTMMFGLPAACLAMYHSVPKNRRKKYAGLFFGVALTSFrrCrrEPIEFMFLFVSPV 
LYVVHAFLDGVSFFlADVLNISIGNTFSGGVlDFTLFGILQGNAKTNWVLQIPFGLrWSVLyYIIFRWFrTQFNVLTPGRGE 
EVDSICEISESADSTSNTADYLKQDSLQIIRALGGSNNIEDVDACVTRLRVAVKEVNQVDKALLKQIGAVDVLEVKGGIQ 
AIYGAKAILYKNSINEILGVDDZ 

35 MKFRKLACTVLAGAAVLGlJ^.ACGNSGGSKDAAKSGGDGAKTErrWWAFPVFTQEKTGDGVGTYEKSIIEAFE^ 

KVKLETIDFKSGPEKITTAlEAGTAPDVLFDAPGRJIQYGKNGKLAELNDLFTDEFVKDVNNENrVQASICAGDICAYMYPJ 
SSAPFYMAMNKKMLEDAGVANLVKEGWTTDDFEKVLKALKDKGYTPGSLFSSGQGGDQGTRAFISNLYSGSVTDEKV 
SKYTTDDPKFVKGLEKATSWIKDNLINNGSQFDGGADIQNFANGQTSYTILWAPAQNGIQAKLLEASKVEVVEVPFPSD 
EGKPALEYLVNGFAVFNNKDDICKVAASICKFIQFlADDKEWGPKDVVRTGAFPVRTSFGFa.YEDiCRMETISGWTQYYSP 

40 YYNTIDGFAEMRTLWFPMLQSVSNGDEKPADALKAFTEKANETIKICAMKQZ 



MQSTEKKPLTAFTVISTIILLLLTVLFIFPFYWILTGAFKSQPDTIVIPPQWFPKMPTMENFQQLMVQNPALQWMWNSVFI 
SLVTMFLVCATSSLAGYVLAKKRFYGQRILFAIFIAAMALPKQVVLVPLVRXVNFMGIHDTLWAVILPLIGWPFGVFLM 
45 KQFSENlPTELLESAKIDGCGEIRTFWSVAFPIVKPGFAALAIFTFINrrWNDYFMQLVMLTSRNNLTrSLGVATMQAEMA 
TNYGLIMAGAALAAVPIVTVFLVFQKSFTQGITMGAVKGZ 



MKIMFKNFNNILLNRKrVLLLRIVLMMILINHLLSTAVQKQDAVIFFKRELISIFSYNDYSEANLEIPKLLLNLSLFMVGW 
50 LSVILLESDLADHYHHURYQSSSFFDYTRKRLVVlSKFFTQDLFVWFLGLLPLGlHFKTVALFFLi^QLMMLYLLLSYU 
ALISAGAGFSFFLYFLAFVGQEWMMDHIVTVYLVLLSLLVMUVSRLEEKFKKGZ 



MGKGEMGKGVIGLEFDSEVLVNKAPTLQLANGKTATFLTQYbSKTLi-FAVDKEDIGOEIIGlAKGSIESMHhU.PVNLA^ 
55 ARVPGGVNGSKAAVHEVPEFTGGVNGTEPAVHElAEYKGSDSLVTLTTKKDrrYKAPLAQQALPETGNKESDLLASLG 
LTAFFLGLFTLGFCKREQZ 



60 MKKTFFLLVLGLFCLLPLSVFAIDFKINSYQGDLYIHADNTAEFRQKIVYQFEEDFKGQIVGLGRAGKMPSGFDIDPHPKI 
QAAKNGAELADVTSEVTEEADGYTVRVYNPGQEGDIVEVDLVWNLKNLLFLYDDIAELNWQPLTDSSESIEKFEFHVR 
GDKGAEKLFFHTGKLFREGTIEKSNLDYTIRLDNLPAKRGVELHAYWPRTDFASARDQGLKGNRLEEFNKIEDSrVREK 
DQSKQLVWVLPSIL^ISLLLSVCFYFIYRRKTTPSVKYAKNHRLYEPPMELEPMVLSEAVYSTSLEEVSPLVKGAGKFTF 
DQLIQATLLDVIDRGNVSUSEGDAVGLRLVKEDGLSSFEKDCLNLAFSGKKEETLSNLFADYKVSDSLYRRAKVSDEKR 

65 IQARGLQLKSSFEEVLNQMQEGVRKRVSFWGLPDYYRPLTCGEKALQVGMGALTILPLFIGFGLFLYSLDVHGYLYLPL 
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PILGFLGLVL^VFYYWKLRLDNRDGVLNEAGAEVYYLWTSFENMLREURLDQAELESrVVWNRlXVYATU^GYADK 
VSHLMKVHQIQVENPDINLYVAyGWHSTFYHSTAQMSHYASVANTASTYSVSSGSGSSGGGFSGGGGGCSIGAFZ 

5 MKKVRKIFQKAVAGLCCISOLTAFSSrVALAETPETSPAIGKVVIlCETGEGGALLGDAVFELKN^^^DGTTVSQR 

EAlFSNrKPGTYTLTEAQPPVGYKPSTKQUTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTPYQtlKVDGS 
EKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGlELTVSGiCTVYEQKDKSVPLDVVILLDNSNSMSNIRNKNA 
RI^ERAGEATRSUDKITSDSENRVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYIXJTSFrrrmCDYSYLKL 
TNDKNDIVELKNKVPTEAEDHDGNRLMYQFGATFTQKAUvlKADEILTQQARQNSQKVIFHn'DGVPTMSYPINFNHAT 

10 FAPSYQNQLNAFFSKSPNKDGIIXSDFITQATSGEHTIVRGDGQSYOMFTDFCTVYEKGAPAAFPVKPEKYSEMKAAGYA 
VlGDPINGGYrWLNWRESILAYPFNSNTAKITNHGDPTRWYYNGNlAPDGYDVFTVGIGINGDPGTDEATATSFMQSISS 
KPENYT^^^aDTTKILEQLNRYFHTIVTEKKSIENGT^'DPMGELIDLQLGTDGRFDPADYTLTANDGSRLENGQAVCG 
QNDGGLIJCNAKVLYDT^EKRIRVTGLYLGTDEKVTLTY^A^RLNDEFV^SNKFYDTNGRTTLHPKEVEQ^^rVRDFPIPKI 
RDVRKYPEITISKEICKLGDIEFIKVNKNDKKPLRGAVFSLQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDG 

15 KYRLFENSEPAGYKPVQNICPIVAFQIVNGEVRDVTSIVPQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYUC 
CMMMGGVLLYTRKHPZ 
\ 

MKSINKFLTMLAALLLTASSLFSAATVFAAGTTTTSVTVHKLI^TDGDMDKIANELETGNYAGNKVGVLPANAKEU^ 
20 VMFVWTNrrNNEIIDENGQTLGVNIDPQTFKL3GAMPATAMiaaTEA£GAKFNTANU>AAKYKIYEIHSI^ 

TLTGSKAVPIEIELPLNDVVDAHVYPKNTEAKPKIDKDFKGKANPDTPRVDKDTPVNHQVGDVVEYEIVTKIPALANYA 
TANWSDRMTEGLAFNKGTVKVTVDDVALEAGDYALTEVATGFDLKLTDAGLAKVNDQNAEKTVKITYSATLNDKAI 
VEVPESNDVTFNYGNNPDHGhrrPKPNKPNENGDLTLTKTWVDATGAPIPAGAEATFDLVNAQTGKVVQTVTLTTDFCN 
TVTVNGLDKNTEYKFVERSIKGYSADYQEITTAGEIAVKNWICDENPKPLDPTEPICVVTYGKKFVKVNDKDNRLAGAEF 
25 VIANADNAGQYLARKADKVSQEEKQLVVTTKDALDRAVAAYNALTAQQQTQQEKEKVDKAQAAYNAAVIAANNAF 
EWVADKDNENVVKLVSDAQGRFEITGLLAGTYYLEETKQPAGYALLTSRQKFEVTATSYSATGQGIEYTAGSCKDDAT 
KVVNKKmPQTCGIGTlIFAVAGAAIMGIAVYAYVKNNKDEDQLAZ 

30 MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 
QrVRDLHSWDENKLSSFKKTSFEKn-FLENQlEVSHIPNGLYYVRSaQTDAVSYPAEFLFENrrDQTVEPLVrVAKKTDTM 
TTKVKL!KVDQDHNRI.EGVGFKLVSVARDVSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGNYRFf:EVEPL 
AGYAVTTLDTDVQLVDHQLVrrrvVNQKLPRGI^DFMKVDGRTNrrSLQGAMFKVMKEESGHYTPVLQNGKEVVVTS 
GKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGEETLVYLDACCHFV 

35 VWZ 

MSHIYLSIFTSLLLMLGLVNVAQADEYLRIGMEAAYAPFNWTQDDDSNGAVKIDGTNQYANGYDVQUKKIAKDLGKE 
PLWKTKWEGLVPALTSGKIDMIlAGMSPTAERKQElAFSSSYYTSEPVLLVKKDSAYASAKSLDDFNGAKrTSQQGVYL 
40 YNHAQIPGAKKETAMGDFAQMRQALEAGVIDAYVSERPEALTAEAANSKFKMIQVEPGFKTGEEDTAIAIGLRKNDNR 
ISQINASIETISKDDQVALMDRMIKEQPAEATTTEETSSSFFSQVAKILSENWCX^LLRGAGrrLLISrVGTnGUIGl^rc^ 
TAPLSE^^CVIYGLQKLVGWVLNVYIE^FRGTPMIVQSMVIYYGTAOAFGINLDRTLAArFIVSI^^^GAYMTEIVRGGILAV 
DKGQFEAATALGMTHNQTMRKrVLPQVVRNILPATGNEFVlNIKDTSVLNVlSVVELYFSGNTVATQTYQYFQTFTllAV 
rYFVLTFTVTRILRFIERRMDMDTYTTGANQMQTEDLKZ 

45 

MTQAILEIKHLKKSYGQNEVLKDISLTVHKGEVISIIGSSGSGKSTFLRSINLLETPTDGQILYHGQNVLEKGYDLTQYREK 
LGMVFQSFNLFENLNVLE^arVAQTTVLKRJERTEAEK:IAK£NLEKVGMGERYWQAKPKQLSGGQKQRVAlARALSMN 
PDAILFDEPTSALDPEMVGEVLKIMQDL^QEGLTMIVVTHEMEFARDVSHRVIFMDKGVIAEEGKPEDLFTNPKEDRTK 
50 EFLQRYLK2 

MKKYQLLFKISAVFSYLFFVFSLSQLTLIVQNYWQFSSQIGNLFWIQNIL^LLFIGVMrvVLVKTGHGYLFRlPRKKWLW 
YSILTVLVLVFQISFNVQTAKHVQSTAEGWAVLIGYSGTNFAELGIYIALFFLVPLMEELIYRGLLQHAFFKHSRFGLDLL 
DD LPSILFALPHFSSLPSLLDIFVFATVGIIFAGLTRYTICSIYPSYAVHVINNIVATFPFLLTFLHRVLGZ 



60 



MNKKQWLGLGLVAVAAVGLAACGNRSSRNAASSSDVKTKAAIVTDTGGVDDKSFNQSAWEGLQAWGKEHNLSKDN 
GFTYFQSTSEADYANNLQQAAGSYNLIFGVGFALNNAVKDAAKEHTDLNYVLIDDVUCDQKNVASVTFADNESGYLA 
GVAAAKTTKTKQVGFVGGIESEVISRFEAGFKAGVASVDPSIKVQVDYAGSFGDAAKGKTIAAAQYAAGADIVYQVAG 
GTGAGVFAEAKSLNESRPENEKVWVIGVDRDQEAEGKYTSKDGKESNFVLVSTUCQVGTTVKDISNKAERGEFPGGQV 
IVYSLKDKGVDLAVTNLSEEGKKAVEDAKAKILDGSVKVPEKZ 
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MSKiCLQQISVPUSVFLGILLGArVMWIFGYDAIWGYEELFYTAFGSLRGIGEIFRAMGPLVLIGLGFAVASRAGFFrA^G 
PGQAl^CWlLSGWFALSHPDMPRPLMll^TIVIALlAGGrVCAlPGILRAYLGTSEVIVTlMMNYrVLYVGNAnHAFP 
FMQSTDSTIRVGANATYQTPWI^ELTGNSRMNIGIFFAIIAVAVIWFMLKICTTLGFEIRAVGLNPHASEYAGISA^ 
SMnSGALAGLGGAVEGLGTFQhfVYVQGSSLAlGFNGMAVSU.AANSPIGILFAAFLFGVLQVGAPGMNAAQVPSELVSI 
5 VTASIIFFVSVHYLIERFVKPKKQVKGGKZ 



MGVKKKLKLTSLLCLSLLIMTACATNGVTSDITAESADFWSKLVYFFAEIIRFLSFDISIGVGIILFTVURTVLLPVFQVQ 
MVASRKMQEAQPRIKALREQYPGRDMESRTKLEQEMRXVFKEMGVRQSDSLWPILIQMPVIlJ^LFQALSRVDFLKTGH 
10 FLWINLGSVDTTLVLPILAAVFTFL5TWL^NKALSERNGATTAMMYGIPVLIFIFAVYAPGGVALYWT^ 
FLNNPFKI1AEREAVVQAQKDLENRKRKAKKKAQKTK2 



MVIDPFAINELDYYLVSHFHSDHIDPYTAAAILNNPKLEHVKFIGPYHCGRIWEGWGVPKERIIVVKPGDTIELKDMKIH 
15 AVESFDRTCLVTLPVNGADETGGELAGLAVTDEEMAQKAVNYIFETPCCTIYHGADSHFSNYFAKHGFODFKrDVALNN 
YGENPVGIQDKMTSIDLLRMAENLRTKVIIPVHYDIWSNFMASTNEILELWKMRKDRLQYDFHPnWEVGGICYTYPQD 
QHLVEYHHPRGFDDCFEQDSNIQFKALL2 



20 MFl^CWLSSFA^^mHDLLGVLFPDSPFLNAFESA^AAPLVEEPLKLLSI:VFVi^^,^PVRKLKSLFLl-G^^ 

GYIRTDLPEGFDFTISRILERIISGIASHWTFSGLAVVGVYLLYRAYKGQKVGICKQGLIFLGLALGTHFLFNSPFVELETEL 
PLAIPVVTA1ALYGFYHAYCFVEKHNEL^4T2 



25 MKVEPRCDVL^RMSHFFfRJLIMELQELVERSWArRQAYHELEVKHHDSKWTVEEDLLALSNDIGNFQRLVMTKQGRY 
YDETPYTLEQKLSENIWWLLELSQRLDIDILTEMENFLSDKEKQLNVRTWKZ 



MLDWKQFFLAYLRSRSRLFIYLLSLAFLVLLFQFLFASLGIYFLYFFFLCCFVriLFFTWDILVETQVYRQELLYGEREAK 
30 SPLElALAEKLEAREMELYQQRSKAERKLTDLLDYYTLWVHQIKTPlAASQLLVAEVVDRQLKQQLEQErFKIDSYTNLV 
LQYLRLESFHDDLVLKQVQIEDLVKEIIRKYALFFIQKGLNVNLHDLDKEIVTDKKWLLVVIEQnSNSLKYTKEGGLEIY 
MDDQELCrKDTGIGIKNSDVLRVFERGFSGYNGRLTQQSSGLGLYLSKKISEELGHQIRIESEVGKGTTVRIQFAQVNLVL 
EZ 

35 

MEL^^"HNAEILLSAANKSHYPQDELPEIALAGRSNVCKSSFI^^■MLNRKNLARTSGKPGKTQLLNFFNrDDKMRFVDVP 
GYGYARVSKKEREKWCCMIEEYLTTRENLRAVVSLVDLRHDPSADDVQMYEFLKYYEIPVIIVATKADiaPRCKWNKH 
ESAIKICKLNFDPSDDFILFSSVSKAGMDEAWDAILEKLZ 

40 

mtkkqlhlvrvtgmsgagio^aiqsfedlgyftldnmppallpkflqlveikednpklalwdmrsrsffs 
enqdgldfkilfldaadkelvaryketrrshplaadgrlldgiklerellaplknmsqnrvvdtteltprelrfrrlaeqf 
sdqeqaqsfrievmsfgfkygipidadlvfdvrflpnpyylpelrnqtgvdepvydyvmnhpesedfyqhllaliepilp 
syqkegksvltiamgctggqhrsvafakrjlaqdlsknwsvneghrdkdrrketvnrs2 

45 



mrkpkitvigggtgspvilkslrekdveiaaivtvaddggssgelrknmqqltppgdlrnvlvamsdmpkfyekvfqyr 
fsedagafaghplgnlilagl^emqgstynamqllskffhttgkiypssdhpltlhavfqekrrevageshivdhrgiidn 

VYVTNALNDDTPLASRRVVQTILESDM^V^LGPGSLFTSILPNIV1KEIGRALLETKAEIAYVCNI^^QRGETEHF^DSDHV 
EVLHRHLGRPFIDTVLVNIEKVPQEYMNSNRFDEYLVQVEHDFVGLCKQVSRVISSNFURLENGGAFHDGDLIVDELMR 

IQVKKZ 



55 MKNLIKLLIIRLIVNLADSVFYrVALWHVSNNYSSSMFLGIFlAVNYLPDLLLIFFGPVIDRVNPQKIUISILVQLAVAVIFL 
LLLNQISFWVIMSLVFISVMASSISYVIEDVLIPQVVEYDKIVFANSLFSISYKVLDSIFNSFASFLQVAVGFILLVKIDIGIFL 
LALFILLLLKFRTSNANIENFSFKYYKREVLQCTKFILNNKLLFKTSISLTUNFFYSFQTVVVPIFSIRYFDGPIFYGIFLTIA 
GLGGILGNMLAPIVIKYLKSNQIVGVFLFLNGSSWLVAIVIKDYTLSLILFFVCFMSKGVFNIIFNSLYQQIPPHQLLGRVN 
TTIDSIISFCMPIGSLVAGTLIDLNIELVLIAISIPYFLFSYIFYTDNGLKEFSIYZ 



MMSNKNKEILIFAlLYTVLFMFDCVICUJVSLMPSAlANYLVYVVLALYGSFLFKDRLIQQWKEIRiCTKRKFF^^^ 

LFLILMTVVFEFVSEMLKQFVGLDGQGLNQSNIQSTFQEQPLLIAVFACVIGPLVEELFFRQVLLHYLQERLSGLLSIILV 

GLVFALTHMHSLALSEWIGAVGYLCGGLAFSnYVKEKENlYYPLLVHMLSNSLSLIILAlSIVKZ 



% % 

87 



UCKPireFKNVSKVFEDSNTKVUCDINFElXEGKFVTU-GASGSGKSmNUAGLLDATTGDI^ 
TWQSYAU^PHMNTVFENVAFPLIUJUaDKKEIEORVAEVLKMVQLEGYEKRSIRKLSCCQRQRVAlARAlIN 
EPLSALDLKLRTDMQYELRELQORLGITFVFVTHDQEEALAMSDWIFVMNDGEIVQSGTPVDIYDEPINHFVATnGESN 
5 ILPGTMIEDYLVEFNGICRFEAVDGGMKPNEPVEVVIRPEDLRITU>EEGKLQVKVDTQLFRGVHYEI1AYDELGNEWM1 
HSTRKAIVGEEIGLDFEPEDIHIMRLNETEEEFDARIEEYVEIEEOEAGUNAIEEERDEENKLZ 



MKSMRILFLL^LIQlSL^SCFLWKECILSFKQSTAFnCSMVFVSGICAGVNYLYTRKQEVHSVl^iCKSVKLFYSM 
10 LLGAVLVLSDNLHKhm-QQELVDFLLPSFFFLFGLDLLIFLPUaCYVRDFLAMLDRKFCTVLN^ 
LIYIGLGLFFAAYLVPNSVKKEVSFYGHIFRDLVLVIVTUFF2 



MVKKriGMVI^L^VTVVGVGVFAYTIYQQGTETUUCTYKKIGEETKVIEATEPLTlLLMGVDTGNVERTF 
15 MILMTVNPKTKJOTMMSLERDILTRrESGNGQAHEAKLNSAYADGGAELAlETIQKMMNIHIDRYVMVNMR^ 

DAVGGITVNNlLGFPISISDQEEFNmSIGVGEQHIGGEEALVYARMRYQDPEGDYGRQKRQREVIQKVMEKALSLNSIGH 
YQEILKAL^DNMQTNIDLSAKSIPNLLGYKDSFKTIETQQtQGEGEILQGVSYQrVSRAHMLEMQNLLRRSLGQEEVTQL 
ETNAVLFEDLFGRAPVGDEDNZ 

20 

MKKQAYVlULTSFLFVFFFSHSLLEILDFDWSIFLHDVEKTEKFVFLLLVFSMS\rrCLl^LFWRGrEELSLRKMQANLK 
RLLAGQE\A^QVADPDLDASFKSLSGFCLNLLTEALQKAENQSLAQEEEIIEKERXR1ARDLHDTVSQELFAAHMILSGISQ 
QALKJ^DREKMQTQLQSVTAILETAQKDLRVIXLHLRPVELEQKSUEGIQILLKELEDKSDLRVSLKQNMTCLPKKIEEHI 
FRILQELISNTLRHAQASCLDVYLYQTDVELQLKVVDNGIGFQLGSLDDLSYGLRNIXERVEDMAGTVQLLTAPKQGLA 
25 VDIRIPLLDKEZ 



MrVSIlSQGFVWAlLGLCIFMTFRILNFPDNm'EGSFPLGGAVAVTLrrKGVNPFLATLVAVGAGCLAGMAAGLLYTKGK 
IPTLLSGILVMTSCHSIMLLIMGRANLGLLGTKQIQDVLPFDSDLNQLLTGUFVSIVIALMLFFLDTKLGQAYIATGDNP 
30 DMARSFGIHTGRMELMGLVL^NGVIAI^CALIAQQECYADVSRGIGVIVVGLASUIGEVIFKSLSLAERLVTIVVGSIAY 
QFLVWAVIALGFNTSYLRLYSALILAVCLMIPTFKQTILKGAKLSKZ 



MKKMKVWSTVl^TGVALTTLAACSGGSNSTTASSSEEKADKSQELVIYSNSVSNGRGDWLTAKAKEAGFNK 
35 AQLADRVIAEKNNAVADMVFGIGAVDSNKIRDQKLLVQYKPKWLDKIDQSLSDJCDNYYNPVIVQPLVLIGAPDVKEMP 
KDWTELGSKYKGKYSISGLQGGTGRAILASILVRYLDDKGELGVSEKGWEVAKEYLKNAYTLQKCESSIVKMLDKEDPI 
QYGMMWGSGALVGQKEQNVVFKVMTPEIGVPFVTEQTMVLSTSICKQALAKEFIDWFGQSEIQVEYSKNFGSIPANICD 
ALKDLPEDTKKFVDQVKPQNIDWEAVGKHLDEWVEKAELEYVQZ 

40 

MIKFDNIQIKYGDFVAIDNLNLDIHEGEFFTFLGPSGCGKSTTLRALVGFLDPSSGSIEVNGTDVTHLEPEKRGIGIVFQSY 
ALFPTMTVFDNlAFGLKVKKVAPDVrKAKVSAVAAKIKISDQQLQRNVSELSCGQQQRVALARALVLEPKILCLDEPLS 
NLDAKLRVDLRKELKRLQKELGITTLYVTHDQEEALTLSDRIAVFNNGYIEQVGTPVEIYHNSQTEFVCDFIGDINVLTD 
ETVHEVLLKNTSVFLEDKKGYtRLEKVRFNRETEQDFILKGTIIDVEFSGVTIHYTIKVSESOILNVTSIDSQAAlRSVGESV 
45 ELFITPSDVLQFZ 



MRHKLNLKDWLIRLGLrWFLVrniYPNFDLVVNVFVKGGEFSLDAVHRVLKSQRALQSIMNSFKLAFSLIITVNVVGIL 

CVLFTEYFDIKGAKILKLGYMTSLIYGGVVLATCYKFVYGPYGLrrKFLQWIPSLDPNWFIGYGAVLFIMTFSGTANHT 

LFLT^^^IRSVDYHTIEAARNMGAKPFTVFRKVVLITLIPTLFALT^MVFL^GL3AVAAPMIVGGKEFQTINPMr^^F 

NSRD^^ALlJV^LGIATTlLLTIM^aCIEKGGNYISlSKTKAPL^aCQKlASKPWNnAHIVAYGLF^VFMUPUFIV 

AIQTGNLTLSNFTLENYRLFFSNSAAFSPFLVSFIYSIIAATTATILAVVFARVVRKHKSRFDFLFEYGALLPWLLPSTLLA 

VSLLFTFNQPQFLVLNQILVGSLVILLlAYIWKIPFSYRMVRAlLFSVDDEMEDAARSMGASPFYTMMKVnPFILPVVLS 

VlALNFNSLLTDFDLSVFLYHPLAQPLCmRSAGDETATSNAQALVFVYTIVLMIISGTVLYFTQRPGRKVRKZ 
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Table 3 

ID201 - 4106.4 




20 



ATGATAAAAAATCCTAAATTATTAACCAAGTCTTTTTTAAGAAGTTTTGCAATTCTATC 

TCATATAGCTATTTATTTGACCTTTC Ci ' i i"l 1 ATTATATTCAACTGGAGGGGGAAAAGTTTAi^^^GAGCGCAAGAG 

TGTTTACGGAGTATTTAAAGACTAAGACATCTGATGAAATTCCAAGCTTACTCCAGTCTT^f^^ 

ATATCTGCTCACCTTAAAAGAGATAI'l' G TAGATAAGCGGCTCCCTCTTGTGCATGACT?^^ 

GCTATCAAATTATATCGTGATGTTAGATATGTCTGTTAGTACAGCAGATGGTAAAC^dTAACCGTGCAAri 

ACGGGGTGGATGTCTACAAAGAAGCAAAGAATATTTTGCTTTTGTATCTCCCATA^^ 

TTTTCCTTTGTTTTTTCTTATTTTTATACTAAACGCTTGCTCAATC 

AATGCAAGATTTGGATGACAATATTCGTTTTGATGAAAGTAGGAAAGATGA^ 

ATGGTATGTATGAGCACTTGTTGAAGGTTATTTATGAGTTGGAAAGTCGT^^ 

CAAAAGGTTTCCTTTGTCCGCGGAGCATCACATGAGTTGAAAACCCCTCTAGCCAGTCT^ 

TATGCAGCATAATATTGGAGATTACAAAGATCATCCAAAATATATI^d&AAAGAGTATAAATA^ 

GCCACTTATTAGAAGAAGTACTGGAGTCTTCTAAATTCCAAGACTCKSACAGACT 

CCAGTTTTAGTAGATATTTTATCACGTTATCAAGAATTAGCTJ^TTCAATAGGTGTTACAAT^ 

agatgctaccagggtcgtcatgagtcttagggcattggataaggttttgacaaacctgat^ 
attcagataaaaatgggcgtgtaatgatatccgagcaagXtggctatctctctatca;^^ 
agtgaccaagaactagaacatttatttgatatattct^fgattctcaaatcgtga^ 
tttgggtctttacattgtgaataatattttagaaaqctatcaaatggattatag^ 
tggaatttaagattagcttgtag 



25 MI KNPKLLTKS FLRS FAI LGGVGLVIHI AIYt-^FP FYY I QLEGE KFNES ARVFTE YLKTKTSDE I PSLLQSYSKSLT 

ISAHLKRDXVDKRLPLVHDUDIKDGKI-SNYTyMIJDMSVSTATC 

FSFVFSYFYTKRLLNPLFYISEVTSKMOp/LpDNIRFDESRKDEVGEVGKQINGMYEHLLKVIYELE^ 
0KVSFVRGASHELKTPIJ^LRIILENMQ™IGDYIa3HPKYIAKSINKIDQ^^HLLEEV^ 

PVLVDI LSRYQELAHS I GVTI ENQLTDATR WMSLRALDKVLTNL I SNAI KYSDKNGRVI I SEQDGYLS I KNTCAPL 
30 SDQELEHLFDIFYKSQIVTDKDESSGLGLYIVNNILESY0r4DYSFLPYEHGMEFKISL2 



ID2 0 2 



4106. 9 



35 



40 



45 



50 



55 



ATGGATAAAATTATTAAAACTATATCAGAAAGCGGAGCCTTTCGTGCTTTTGTCCTTGATAGCACTGAAACCGTCCG 

cactgctct^gaaaaacatcaa^cccaagctagctcaactgtagcgcttggtcg^ 

tcgcagccaatgaaaaaggaa^taccaaacttacagttaaggtgttgggatctagctctctaggtgctattatcacc 
gtcgctgataccaaggggaacgtcaaaggctatgttcaaaatcctggtgttgacatcaaa;^ 
agtcctagtcggaccttttgjtggaaatggtcaattcctcgttatcacagactacggt^ 
ctataactcccctcatctqtcgagaaatcggtgaagaccttgccttttaccttactgaaagccaa 
gcggtcggccrcaatgtc^gtt'ttggacgaggaagacaaggtcaaggttgcaggtggi^ 
aggagccaagaaagaagp^ttgctcgctttgaaaaacgcatccaagaaatgccagct 
gcgacgaccatatcgaagccctcctcaaggctatctacggggacgaagcctacaagcgtcttt 
cgtttccaatgtgactgyragccatgaacgctttatgaacgctcttgccagccttccaagctcagacttac^ 
gaaagaggaagaccacggggcagaaatcacttgtcaattctgccaaactacttacaactttgatgaaaagga^ 
aggaactcattcgtgacaaatcttaa 



./I 



MDKIIKTISESGAFRAFVIJDSTETVRTA0EKH0T0ASSTVALGRTL.IASQI1J^ANEKGNTK^ 
VADTKGNVKGYyQNPGVDIKKTATGEVLVGPFVGNGQFLVITDYGTGNPYNSITPLISGEIGEDLAFYLTESOQTPS 
AVGLNVLLDEEDKVKVAGGFLVQVLPGAKKEEIARFEKJilQEMPAISTLLESDDHIEALLKAIYGDEAYK^^ 
RFQCDCSHERFMNAIuASLPSSDLOEMKEEDHGAEITCQFCOTTYNFDEKDLEELIRDKSZ 



ID2 0 3 




ATGAAATCT^TAACTAAAAAGArTAAAGCAACTCTTGCAGGAGTAGCTGCCTTGTTTGCAGTAT^ 
TGTAT^TGCTCAAGAATCATCAACTTACACTGTTAAAGAAGGTGATACACTTTCAGAAATC 
CAAC^TTGAAAAATTGGCAGAAAACAACCACATTGATAACATTCATTTGATTTATGTTGATC^ 
GATGGCCCTGTAGCGCCTGTTGCAACACCAGCGCCAGCTACTTATGCGGCACCAGCCGCrCAAGATGAAACTGTTTC 



V 
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AGCTCCAGTAGCAGAAACTCCAGTAGTAAGTGAAACAGTTGTTTCAACTGTAAGCGGATCTGAAGCAGA^ 
AATGGATCGCTCAAAAAGAATCAGGTGGTAGTATACAGCTACAAATGGACGTTATATCGGACGTTACCAATTAA 



5 MiCSITKKIKATI-AGVAALFAVFAPS^VSAOESSTYTVKEGDTLS£IAETH^rITVEKIJ^ 

DG PVAP VATP APATYAAPAAQDETVS AP VAETPWS ETWSTVSGS EAEAKEW I AQKESGGS I QLQMDVI SDVTN2 

ID204 - 4117.1 

1 0 ATGAATTTAGGAGAATTTTGGTACAATAAAATAAATAAGAACAGAGGAAGAAGGTTAATGA^ 

' XTTT ' l ' lA GCTCTGCTA l ' i ' l ' l I ' L ' l I 'AGCTAGTCCAGAGGGTGCAATGGCTAGTGATGGTACTTGGCAAGGAAAACAGT 
ATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGTTTTTGATACTCATTATCAATCTTGGT^ 
GCAGATGCTAACTATGCTGAAAATGAATGGCTAAAGCAAGGTGACGACTAI I I i ' l 'ACCTCAAATCTGGTGGCTATAT 
GGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCTTTTATTATCITGACCAAGATGGAAAGATGAAAAGAAATGC^ 

15 gggtaggaacttcctatgttggtgcaacaggtgccaaagtaatagaagactgggtctatgattctcaatacgatgct 
tggttttatatcaaagcagatggacagcacgcagagaaagaatggctccaaattaaagggaaggactattat^ 
atccggtggttatctactgaca^gtcagtggattaatcaagcttatgtgaatgctagtggtgccaaagtacagc;^ 
gttggctttttgacaaacaataccaatcttggttttacatcaaagaaaatggaaactatg 
ttcgagaatggtcactattattatctaaaatccggtggctacatggcagccaatgaatggatttgggataagg;^ 

20 ttggttttatctcaaatttgatgggaaaatggctgaaaaagaatgggtctacgattctcatagtcaa^ 

acttcaaatccggtggttacatgacagccaatgaatggatttgggataaggaatcttggttttatctca^ 
gggaaaatagctgaaaaagaatgggtctacgattctcatagtcaagcttggtactacttcaaatccgg^ 
gacagccaa.tgaatggatttgggataaggaatcttggttttacctcaaatctgatgggaaaa^^ 
gggtctacgattctcatagtcaagcttggtactacttcaaatctggtggctacatggcgaaaaatgagac^^ 

25 ggttatcagcttggaagcgatggtaaatggcttggaggaaaaactacaaatgaaaatgctgcttactatc^ 
gcctgttacagccaatgtttatgattcagatggtgaaaagctttcctatatatcgcaaggtagtgtc^ 
ataaggatagaaaaagtgatgacaagcgcttggctattactatttctggtrrgtcaggctatatgaaaacagaac^ 
ttacaagcgctagatgctagtaaggacrt t atcccttattatgagagtgatggccacc g tttttatcactatgtggc 
tcagaatgctagtatcccagtagcttctcatctttctgatatggaagtaggcaagaaatattattcggcagatggcc 

30 tgcattttgatggttttaagcttgagaatcccttccttttcaaagatttaacagaggct^ 

gaattggataaggtatttagtttgctaaacattaacaatagccttttggagaacaagggcgcta l" 
cgaagaacattaccatatcaatgctctttatctccttgcccatagtgccctagaaagtaactggggaagaagtaaaa 
ttgccaaagataagaataatttctttggcattacagcctatgatacgaccccttac c tttctgctaagacatt^ 
gatgtggataagggaattttaggtgcaaccaagtggattaaggaaaattatatcgataggggaagaactttccttgg 

35 aaacaaggcttctggtatgaatgtggaatatgcttcagacccttattggggcgaaaaaattgctagtgtc 
aaatcaatgagaagctaggtggcaaagattag 



MNLGE^WYNKIMKNRGRR1J^KKVRFIFLALLFFI^PEGA^^DGTWQGKQYLKEDGSQAANEW 
40 ADANYAENEWLKQGDDYF YLKSGGYMAKS E WEDKGAFY YLDQDGKMKRNAWVGTS YVGATGAKVI EDWVYDSQ YDA 

WFYI KADGOHAEKEWLO I KGKDY YFKSGGYLLTSQW INQAYWASGAKVQOGWLFDKQYQSWFYI KENGNYADK^ 
FENGHYYYLKSGGYMAANEWIWDKESWFYLKFDGKMAEKEWYDSHSQAWYYFKSGGYMTANEWIWDKESWFTLI^ 
GKIAEKEVAriT)SHS0AWYYFKSGGYMTANEWIWDK£SWFYLKSDGKIAEKXWVYDSHS0AWYYFK5GG 
GYOU3SDGKWU3GKTTNENAAYYOWPVTANVYDSDGEKLSYISOGSVWLDKDRKSDDKRIAITISGLSGYMCT 
45 LOALDASKDFIPYYESDGHRFYHYVAQNASIPVASHLSDMEVGKKYYSADGLHFDGFKLENPFLFKDL.TEATNYSAE 
ELDKVFSLU^I^n^SLLE^^KGATFKEAEEHYHINAI-YI*LAHSALESNWGRSKIAKIDKNNFFGIT^ 
DVDKGILGATKWIKENYIDRGRTFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKDZ 

ID20S - 4118.1 

50 ATGAAAAAATTAGGTACATTACTCGTTCTCTTTCTTTCTGCAATCATTCTTGTAGCATGTGCTAGCGGAA^ 
TACAACTTCTGGTCAAAAACTAAAAGTTGTTGCTACAAACTCAATCATCGCTGATATTACTAAAAATAT^ 
ACAAAATTGACCTTCATAGTATCGTTCCGATTGGGCAAGACCCACACGAATACGAACCACTTCCTGAAGACGTTAAG 
AAAACTTCTGAGGCTAATTTGATTTTCTATAACGGTATCAACCTTGAAACAGGTGGCAATGCTTGG^^ 
GGTAGAAAATGCCAAGAAAACTGAAAACAAAGACTACTTCGCAGTCAGCGACGGCGTTGATGTTATCTACCTTG 

5 5 GTCAAAATGAAAAAGGAAAAG A^G ACCCACACGCITGGCTTAACCTTGAAAACGGTATTATTT^ 

GCCAAACAATTGAGCGCCAAAGACCCTAACAATAAAGAATTCTATGAAAAAAATCTCAAAGAAT^ 
AGACAAACTTGATAAAGAAAGTAAGGATAAATTTAATAAGATCCCTGCTGAAAAGAAACTCATTGTAACCAGCGAA 
GAGCATTCAAATACTTCTCTAAAGCCTATGGTGTCCCAAGTGCTTACATCTGGGAAATCAATACTGAAG^ 
ACTCCTGAACAAATCAAGACCTTGGTTGAAAAACTTCGCCAAACAAAAGTTCCATCACTCTTTGT^ 

60 GGATGACCGTCCAATGAAAACTGTTTCTCAAGACACAAACATCCCAATCTACGCTCAAATCTTTACTGACT 
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CAGAAa^AGGTAAAGAAGGCGACAGCTACTACAGCATGATGAAATACAACCTTGACAAGATTGCT 
AAATAA 



5 MKKI^TLLVLFLSAIILVACASGKKDTTSGOKLKWATNSIIADITKNIAGDKIDLHSIVPIGQDPHEYEPLPEDVK 
KTS EANL r FYNG I NLETGGNAWFTKLVENAKKTENKD Y FAVS DGVDVI YLEGQNEKGKEDPHAWLNLENG 1 1 FAKNI 
AKOLSAKDP^mKEFYEKNLK2:YTDKlX>KIJ3KESKDKFNKIPAEKKLIVTSEGAFKYFSKA^ 

TPEO I KTLVE KLRQTKVPS LFVESSVDDRPMKWSQDTN I P I YAQ r FTDS I AEQGKEGDS Y YSMMKYNLDKI AEGLA 
KZ 

10 

ID206 - 4119.1 

atggaatggtataaaaaaatcggacttcttgcaactacaggtttagctttgtttgggct 
tggtaaatctgcggatggcacagtgaccatcgagtatttcaacctvgaaaaaagaaatgaccaaaaccttggaag;^^ 

TCACTCGTGATTTTGAGAAGGAAAACCCTAAGATCAAGGTCAAAGTCGTCAATGTACCAAATGCTGGTGAAGTA^ 
15 AAGACACGCGTTCrrCGCAGGAGATGTGCCTGATGTGGTCAATATTTACCCACAGTCCATCGAACTGCAAGAATGOT 

AAAAGCAGGTGTTTTTGAAGATTTGAGCAACAAAGACTACCTGAAACGCGTGAAAAATGGCTACGCTGAAAAAT^ 

CTGTAAACGAAAAAGTTTACAACGTTCCTTTTACAGCTAATGCTTATGGAATTTACTACAAC^^ 
- GAACTGGGCTTGAAGGTTCCTGAAACCTGGGATGAATTTGAAOVGTTAGTCAAAGATATC^ 

ACCATTTGGAATTGCAGGTGCAGATGCTTGGACACTCAATGGTTACAATCAATTAGCCTTTGCGA 
20 GAGGAAAAGAAGCAAATCAATACCTTCGTTATTCTCAACCAAATGCCATTAAATTGTCGGATCCGATTATGAAAGAT 

GATATCAAGGTCATGGACATCCTTCGCATCAATGGATCTAAGCAAAAGAACTGGGAAGGTGCTGGCTATACCGATGT 

TATCGGAGCCriTCGCACGTGGGGATGTCCTCATGACACCAAATGGGTCTTGGGCGATCACAGCGATTAATGAACAAA 

AACCGAACTTTAAGATTGGGACCTTCATGATTCCAGGAAAAGAAAAAGGACAAAGCTT^ 

TTGGCATGGTCTATCTCAGCCACCACCAAACATCCAAAAGAAGCCAATGCCTTTGTGGAATATATG^^ 
25 AGTCATGCAAAAATACTACGATGTGGACGGATCTCCAACAGCGATCGAAGGGGTCAAACAAGCAGGAGAAGAT^ 

CGCTTGCTGGTATGACCGAATATGCCTTTACGGATCGTCACTTGGTCTGGTTGCAACAATACT 

GACTTCCATACCTTGACCATGAACTATGTCrrrGACCGGTGATAAACAAGGCATGGTCAATGATTTGAATGC C TT 

TAACCCGATGAAAGCGGATGTGGATTAG 

30 

MEWYKKIGLLATTGIJU.FGLGACS^nfGKSAIX^VTIEYFNQKKE^r^^CTLEEITRDFEKE^ 
KTRVI^GDVPDVVNIYPOSIELQEWAKAGVFEDLSNKDYLKRVKNGYAEKYAVNEKVTOVPFTANAYGIYYN 
ELGLKVPETWDEFEQLVKDIVAKGOTPFGIAGADAWTUNGYNQLAFATATGGGKEANQYLRYSQPNAIKLSDPIMKD 
D I KVMD I LR I NGS KQKNWEGAG YTDV I G AFARGDVU^T PNGS WA I TA I NEO KPNFKI GT FM I PG KE KGQS LTVG^^ 
35 LAWSISATTKHPKSLANAFVEYMTRPEVMOKYYDVDGSPTAIEGVKQAGEDSPIAGKTEYAFTDRH 
DFHTLT^I^n^V'LTGDKOGMVNDLNAFFNPMKADVDZ 

ID207 - 4123.1 

ATGAAGAAAATCAAACCGCATGGACCGTTACCAAGTCAGACTCAGCTAGCTTATCTGGGAGATGAACTAGC^ 
40 TATCCACT7CGGTCCTAATACCTTTTATGACCAAGAATGGGGGACTGGACAGGAGGATCCTGAGCGCTTTAACCCGA 
GTCAGTTGGATGCGCGTGAGTGGGTTCGTGTGCTCAAGGAAACGGGCTTCAAAAAGTTGATTTTGGTGGTCAAGCA^ 
CACGATGGCTTTGTCCTTTATCCGACAGCTCACACAGATTATTCGGTTAAGGTCAGTCCTTGGAGGAGAGGAAAGGG 
CGACTTGCTCCTTGAAGTATCCCAAGCTGCCACAGAGTTTGATATGGATATGGGGGTCTACCTGTCACCGTGGGATG 
CCCATAGTCCCCTCTATCATGTGGACCGAGAAGCGGACTACAATGCCTATTATCTGGCTCAGTTGAAGGAAATC^ 
45 TCAAATCCTAACTATGGGAATGCTGGTAAGTTCGCTGAGGTTTGGATGGATGGTGCCAGAGGAGAGGGCGCGCAAAA 
GGTTAATTATGAATTTGAAAAATGGTTTGAAACCATTCGTGACCTGCAGGGCGATTGCTTGATTTT^ 
GCACCAGTATCCGCTGGATTGGCAATGAACGAGGGTATGCAGGTGATCCACTGTGGCAAAAGGTGAATCCTGATAAA 
CTAGGAACAGAAGCAGAGCTGAACTATCTTCAGCACGGGGATCCCTCGGGCACGATTTTTTCAATCGGAGAGGCAGA 
TGTTTCCATCCGTCCAGGCTGGTTCTACCATGAGGATCAGGATCCTAAGTCrCTCGAGGAGTTGGTCGAAATCT 
50 TTCACTCAGTAGGGCGAGGAACTCCACrCTTGCTTAATATTCCGCCGAATCAAGCTGGGCTCT^ 
ATTGAACGACTTTATGAATTTGCGACCTATCGCAATGAGCTCTATAAAGAAGATTTGGCrCTGG^ 
TGGTCCAGCTCTTTCCGCAGACTTTGCTTGTCGCCATTTGACAGACGGCCTTGAGACCAGCTCrTGGG^^ 
CAGACTTGCCCATCCAGTTAGAACTCGACTTAGGTTCTCCTAAAACTTTTGATGT^ 

aagctagggcaacgaatcgctgcttttcatgtgcaagtagaggtggatggtgtctggcaggagtitggttcgggt^ 
55 tactgttggrracaaacgtctcttacgagga > 

aggctttgcctttgttgaccaagatttccctttataaaactcctggattatcaaaaaa^ 
gcatttgcagaaaaaagcctagcrgtggcaaagggagaaaatgcctattttacagttaagcgcag^ 

tcctttagaagctaagarrrcgattcaaccggggacaggtgtccatggtgtcgcctatcaggatgagattc;^ , 
ttgcgtttcaaactggtgagactgaaaaaagtctgacgctaccaaccttgtatttcgcaggagata;^ 
60 ttctatctgaacctaacggtggatggtcagcttgtggatcaacttcaagtccaagtttcataa 
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MKKIKPHGPLPSQTQIJVyLGDEIJ^FIHFGPNTFyDOEIWGTGOEDPERFNPSQUDAREV^VLK^ 
HDGFVLYPTAHTDYSVKVS PWRRGKGDLLLEVSQAATEFDMDMGVYLS P WDAHS PLYHVDREAD YNAYYLAQLKE I L 
SNPNYGNAGKFAEVWMDGARGEGAOKVNYEFEKWFETIRDLOGDCLIFSTEGTSIRWIGNERGYAGDPL^^ 
5 LGTEA£L^nfWHGDPSGTIFSIGEADVSIRPGWFYHEDQDPKSLEELVEIYFHSVGRGTPLLU^IPPNQAGLFDAKD 
lERLYEFATYRNELYKEDLALGAEVSGPALSADFACRHLTDGLETSSWASDADLPIQLELDLGSPKTFDVIELREDL 
KIX3QRIAAFHV0VEVDGVWQEFGSGHTVGYKRLLRGAWEA0KIRWITESQALPLLTKISLYKTPGLSKK^ 
AFAEKSIAVAKGENAYFTVKRRECSGPLEAKISIQPGTGVHGVAYQDEIQVIAFQTGETEKSLTLPTLYFAGDKTIJ^ 
FYLNLTVDGQLVDQLQVQVS Z 

10 

TD208 . 4125.12 

atgcttgaaagactgaaaagaatacattatatgttttggatcagtttaatttttatgattttccccat cct^ 
agtgactgggtggctttctgccrggcatttattgattgatattctatttgtagtggcatatttgggtct 
ctaagagccagcgcctatcritggctatattggggcctcatgctgacttatgtagttgggaatactgcctttg^ 
1 5 gttaattatatctggttt i tctt 1 1 tcctatccaatctcttaagttatcatttcagcgtacgtagtttaaagtcttt 
acatgtctggacttttcttcttgctcaagtccttgttgtggggcaactgttgatttttcagagaa 
ttctattctatctacttgtaattcttacttttgtcgatttaatgacttttggattggttcggattc 
gatttgaaagaagctcaggtcaagcaaaatgctcagataaatctattgcttgctgaaaatg;^ 

TCAGGATTTGCATGATAGTCTGGGACATACCTTTGCTATGCTGAGTGTCAAGACAGATTTAGCCTTGC^ 
20 AGATGGAGGCTTATCCACAGGTGGAAAAGGAATTAAAAGAAATTCACCAGATCAGCAAGGATCCATGA 



r^ERLKRIHYMFWISLIFMIFPILSVVTGWLSAWHLLIDILFWAYLGVLTTKSQRiSWLYWGLMLTYVV 
VNYIWFFFFLSNLLSYHFSVRSLKSLHVWTFLIJVQVLWGQLLIFORIEVEFLFYLLVILTFVDLOTrGLVRIRI 
25 DLKEAQVKQNAQINLLI^NERSRIGQDUCDSLGHTFAMLSVKTDLAI^LFOMEAYPQVXKELKEIHQISK^ 

IP209 - 4126.3 

ATGAATGATAAGTTAAAAATCTTCTTGTTGCTAGGAGTATTTTTTCTAGCCATAACCGGTTTCTATGTTCTATTGAT 
ACGAAATGCAGGGCAGACAGATGCCTCGCAAATTGAAAAGGCGGCAGTTAGCCAAGGAGGAAAAGCAGTGAAA^ 

30 - CAGAAATTAGTAAAGACGCAGACTTGCACGAAATTTATCTAGCTGGAGGTTGTTTCTGGGGAGTGGAGGAATATT^ 
TCACGTGTTCCCGGGGTGACGGATGCCGTTTCAGGCTATGCAAATGGTAGAGGAGAAACAACCAAGTACGAAT TGA T 
TAACCAAACAGGTCATGCAGAAACCGTCCATGTCACCTATGATGCCAAGCAAATTTCTCTCAAGGAAATCC^ 
ACTATTTCCGCATTATCAATCCAACCAGCAAAAATAAACAAGGAAATGATGTGGGGACCCAGTA 
TATTACACAGATGACAAGGATTTGGAAGTGATTAACGAAGTCITTGATGAGGTGGCTAAGAAATAra^ 

3 5 AGCAGTTGAAAAGGAAAACTTG AAGAATTTTGTGGTGGCTGAGGATTACC ATCAAGACTATCTCAAGAAAAATC 

ATGGCTACTGCCATATCAATGTTAATCAGGCGGCCTATCCTGTCATTGATGCCAGCAAATATCCAAA ACCAA GTGAT 
GAGGAATTGAAAAAGACCCTGTCACCTGAGGAGTATGCAGTTACCCAGGAAAATCAAACAGAACGAGCTTTCT 
CCGTTACTGGGATAAATTTGAATCCGGTATCTATGTGGATATAGCAACTGGGGAACCTCTCTTTTCATCAAAAGACA 
AATTTGAGTCTGGTTGTGGCTGGCCTAGTTTTACCCAACCCATCAGTCCAGATGTTGTCACCTACAAGGAAGATAAG 

40 TCCTACAATATGACGCGTATGGAAGTGCGGAGCCGAGTAGGAGATTCTCACCTTGGGCATGTCTTTACGGATGGTCC 
ACAGGACAAGGGCGGCTTACGTTACTGTATCAATAGCCTCTCTATCCGCTTTATTCCCAAAGACCAAATGGAAGAAA 
AAGgcTACGCTTATTTACTAGATTATGTTGATTAA 

45 MNDKLKIFLLLGVFFIJ^ITGFYVLLIRNAGOTDASQIEKAAVSQGGKAVKKTEISKDADIJiEIYI^ 

SRVPGVTDAVS G YANGRG ETTKYE L I NQTGHA£TVH\n' YDAKQ I S LKE I LLHYFRI INPTS KNKOG^roVGTQ Y 
YYTDDKDLEVINQVFDEVAKKYDOPIJVVEKENLKIIFWAEDYHQDYLKKNPNGYCHINVNQAAYPVIDASKY^ 
EELKICTLSPEEYAVTQENQTERAFSNRYWDKFESGirvTrlATGEPLFSSKDKFESGCGWPSFTOPISPDVVTYKED^ 
S YNMTRMEVRSRVGDSHLGHVFTDGPQDKGGLRYCINSLS IRFI PKDQMEEKGYAYLLDYVDZ 

50 

ID210 - 4127.1 

ATGAAAAAGAAATGGATGTATTATGCTGCTTGTTCTTCTAATGAATCrrGCCGATGACAGTTCATCTGATAAAGGAGA 

CGGCGGTTCGCTAGTCGTTTArrCACCAAACTCAGAGGGCTTAATTGGAGCAACTATTCCTGCCTTTGAAGAAA 

ATGGTATCAAAGTAGAACTGATTCAAGCTGGTACTGGAGAACTTTTCAAAAAACTAGAGTCAGAAAAAGAAC^ 

55 GTAGCTGATGTTATCTTTGGTGGTTCTTATACACAATATACTACCCACGGAGAACTCTTTGAAAACTA^^ 
AGAAAATGATAATGTTATCAAAGAATATCAAAACACAACTGGCTACTCTACTCCTTATACACTAGATG^ 
TAATCGTCAACCCTGATTTAACTAAAGGCATGAACATCGAAGGATATAACGATCITTTCAAACCT 
AAAATCGCAACTGCTGACCCAGCAAACTCrTCTAGCGCCTTTGCTCAATTAACAAATATGCTA^^ 
TTACAAAGATGATAAGGCTTGGTCTTATGTAAAAGAT C TTTTCACACTTATTGATGGTAAAATCGGTTCAAGT^ 

60 CTAGTGTCTATAAAGTAGTCGCTGATGGAGAAATGGCTGTTGGTCTCTCTTATGAAGATCCAGCAGTTAAACTCTTA 
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AATGACGGAGCTAACATTAAGGTAGTCTATCCAAAAGAAGGAACCGTCTTCCTACCTGCTAGTGCTGCTATCGTTAA 
AAAATCTAAAAATATGGAAAATGCCAAGAAATTTATCGATTTTATTATCrrCTCAAGAAG 
CAACCACTACTAACCGTCCTGTTCGTAAAAATGCTAAAACAAGCGAAAACATGAAACC^ 
CTCACTGAAGATTATGATTATGTCATCAAGAATAAATCAGATATCGTTAAGAAATACAACGAAGTCT 
5 CCAATCTAAACAGTAA 



MKKKWMYYAACSSNESADDSSSDKGDGGSLVVYSPNSEGLIGATIPAFEEKYGIKVELIQAGTGELFKKLESEKEVP 
VADVIFGGSYTQyTTHGELFE^nrTSKENDNVIKEYO^^TGySTPYTLDGSVLIVNPDLTKGMNIEGylro 
10 KIATADPANSSSAFAOLTWIIiOAOGGYKDDKAWSYVKDLFTLIDGKIGSSSSSVYKWADGEMAVGI^ 
NDGANI KVVYPECEGTVFLPASAAIVKXSKNMENAK^ 
LTEDYDYVI KNKSDI\nCKYNEVFTO^SK02 



1 5 / ATGAGTGAGATCAAAATTATTAAGdcCAAAAAAATCTACCACGATGTCCCTGTTATTGAGAATTTC 
/ TCCAAAAGGAAGTCTCTTT^GCfCTTCTTGGAGCTTCAGGATGTGGGAAAACGACCCT^ 
( TCAACAGTATCGAAGgTGdAGAATTTTACTTCGATGATACAAAAATCAATAATATGGAACCCAGC^^ 
V GGGATgGI^-TTCCAAAACTACGCTATTTTCCCACAT^ 
^ — ^GAAGGTTCCAAAAGAAGAATTGATTCAACAGACCAACAAGTATCTTGAACTCATGCAAAT^ 
20 GAAAGCCCGATAAACTCAGTGGTGGAOVACAACAACGTGTCACCTTGGCATGCGCCTTAGCGGTTA^ 

CTCCTCA.TGGACGAGCCACTTAGTAATCTGGAGGCCAAACTTCGCTTGGATATGCGTCAAGCCATCCG^^ 
ACACGAAGTGGGAATTACAACTGTTTATGTAACCCACGACCAAGAAGAAGCCATGGCTATTTCAGACC^^ 
TTATGAAAGATGGGGTGATCCAACAAATCGGCCGACCAAAAGAACTCTATCATAAACCAGCrrAATGAGTTTGTM 
ACCTTTATCGGACGCACAAATATTATCCCTGCCAATCTTGAAAAACGGAGCGACGGCGCTTATATCGTCT^ 
TGGCTATGCCCTTCGAATGCCAGCTCTTGATCAGGTTGAGGAGCAAGCTATTCATGTAAGCATTCGTCCCGAAGAGT 
TTATCAAAGAT GAAT CTGGAGATATTGAAGGAACTATTAGAGATAGCGTCTATCTTGGACTAAATACGGATTATT^ 
ATTGAGACAGGTTTTGCCTCAAAAATTCAAGTTAGTGAAGAATCAACrrTTGAAGAAGATCTACAAAAAG^ 
TATTCGTCTACGAATCA^TACGGAAAAATTAAACATG^^^ 

ACCATGGAACGTAjC'" ^ . 



25 



30 



/ MSEIKIINAKKIYHDVPVIENUJITIPKGSLFTLLGASGCGKTTLLRMIAGFNSIEGGEFYFDDTKItnmEPSKI^ \ 

t gmvfqnyaifphltvrdnvafglmokkvpkeeliqqtnkylelmoiaoyadrkpdklsggqqqrvtijvcai^ ) 

I LUTOEPLS^nLEAKLRLDMROAIR£IOHEVGITTVYVTHDOEEAMAISDQIAVMKDGVIQQIGRPKELYHK^ / 
^Xr^^^™^^P^^^^S^^AYIVFSDGYAl.RMPAI.DQ\^EOAIHVSrRPEEFIKDESGDrEGTIRDSVYLGU^ 
35 lETGFASKIQVSEESTFEEDLQKGNRIRLRItrrOKLNIFSADGSQNLIKGVNHGTZ 



40 



ID212 - 4136.1 ' ~ ^ 

- ATGAAGAAAAAATTATTGGCAGGTGCCATCACACTATTATCAGTAGCAACTTTAGCAGCTTGTTCGAAAGGGTCAGA 
AGGTGCAGACCTTATCAGCATGAAAGGGGATGTCATTACAGAACATCAATTTTATGAGCAAGTGAAAAGCAACCCTT 
CAGCCCAACAAGTCTTGTTAAATATGACCATCCAAAAAGTTTTTGAAAAACAATATGGCTCAGAG 
GAGGTTGATGATACTATTGCCGAAGAAAAAAAACAATATGGCGAAAACTACCAACGTGTCTTGTCAC^ 
GACTCIT'GAAACACGTAAAGCrCAAATTCGTACAAGTAAATTAGTTGAGTTGGCA^ 

AATTGACAGATGAAGCCTATAAGAAAGCCTTTGATGAGTACACrCCAGATGTAACGGCTCAAATCATCCG^ 
AATGAAGATAAGGCCAAAGAAGTTCTCGAAAAAGCCAAGGCAGAAGGTGCTGATTTTGCTCAATTAGC 

45 TTCAACTGATGA AAAA ACAAAAGAAAATGGTGGAGAAATTACCTTTGATTCTGCTTCAAC^ 

TCAAAAAAGCCGCTTTCGCTTTAGATGTGGATGGTGTTTCTGATGTGATTACAGCAACTGGC^ 
AGCCAATATTACATTGTAAAACTCACTAAGAAAACAGAAAAATCATCTAATATTGATGACTACAAAGAAAAAT^ 
AACTGTTATCTTGACTCAAAAACAAAATGATTCAACATTTGTTCAAAGCATTATCGGAAAAGAAT^^ 
ATATCAAGGTTAAGGACCAAGCCTTCCAAAATATCTTTACCCAATATATCGGTGGTGGAGATTCAAGCTCAAGCAGT 

3U AGTACATCAAACGAATAG 

MKKKLLAGAITLLSVATIJW^CSKGSEGADLISMKGDVITEHOFYEOVKS^rPSAQQVLL^WTIQ^ 

EVDDTIAEEKKQYGENYORVLSQAGMTLETRKAQIRTSKLVEI^VKKVAEAELTDEAYKKAFDEYTPDVTAQIIRI^ 

NEDKAKEVLEKAKAEGADFAOLAKDNSTDEKTKENGGEITFDSASTEVPEQVKKAAFALDVDGVSDVITATGTQAY 

SQYYIVKXTKKTEKSSNIDDYKEKXKWILTQKOOTSTFVOSIIGKSLOAANIICVKDQAFONIFTQYIGGGDSSSSS 
STSMEZ 



60 



A 
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ID213 - 4137.3 

ATGAAAAAAAATATTAAACAATATGTAACCrrAGGTACTGTAGTGGTATTATCAGCATTTGTTGCT^ 
AGCTCAGGAGACTGAAACTTCTGAAGTATCAACACCAAAGTTGGTGCAACCTGTTGCACC^ 
AAGTACAACCTACATCGGATAACTCTTCGGAAGTTACTGTACAACCTCGAACAGTTGAAACT 
5 TCTTCTACAGCGGAAGAAACTCCTGTCTTAGAAAAAAATAATGTTACTTTAACAGGGGGC 

AGAGTTAAAGGATAAATTTACTAGCGGTGACi'ilACTGTAGTGATTAAGTACAATCAGTCAAGTGAGAAA 
AAGCTCTGTTTGGAATATCTAATTCCAAACCCGGTCAACAAAATAGTTATGTAGATGTGTTCCTTA 
GAGTTGGGGATGGAAGCGCGTGATACTTCTTCCAATAAAAATAACCTAGTATCCAGACCTGCTTCAGm 
GTACAAACAAGAGGCTGTGACTAACACTGTTGCAGTAGTAGCAGATTCAGTCAAAAAAACATATTCT^ 
10 ATGGTACAAAAGTAGTAGAAAAGAAAGTGGATAATTTCCTAAACATCAAGGATATTAAAGGTAT^^ 
CTTGGGGGAGTGAAACGTGCAGGAAAAACGGCGTTTGGTTTTAACGGAACACT 
TGCATTGGATGAAGAAACTGTTAAAAAGATGACAACAAACGCTGTTACTGGACATTTAATT^ 
CAACAGGTTCTAACTATTTCCGTATTCCAGTTCTGTATACTTTTAGCAATGGTCGGGT^ 

CGTTACGGTGGAACTCATGAT T I'C'n'GAATAAAATTAATATTGCT ACAAGTTATAGTGATGATAATGGTAAGACATG 
1 5 GACTAAACCAAAATTAACATTGGCATTCGATGATTTTGCGCCAGTACCATT AGAATGGCCTCGTG 

GTGACTTACAAATCAGCGGTGGTGCAACCTATATTGACTCTGTTATTGTTGAAAAAAAGAACAAACAAGTACT 
TTTGCTGATGTGATGCCTGCTGGAGTAAGTTTTAGAGAAGCAACTAGAAAAGATTCAGGTTATAAACAA^ 
TAATTATTACCTTAAATTAAGGAAAOL/VGGTGATACTGATTACAATTATACTATTCGTGAGAATGGl^ 
ACGATCGTACCAACAGACCAACTGAATTTTCAGTAGATAAAAATTTCGGTATTAAACAAAATGGTAATTATTTGACG 
20 GTAGAGCGG 

MKKNIKQYVTLGTWVLSAFVANSVAAQETETSEVSTPKLVQPVAPTTPISEVQPTSDNSSEVTVOPRTVETTVKDP 
SSTAEETPVLEKNNVTLTGGGE^A^^KELIaDKFTSGDFTWIKYNOSSEKGLQALFGIS^ISKPGQONS^^ 
£I^MEAfU3TSSNKNNLVSRPASWGKYKQEAVTNTVAWADSVKKTYSLYANGTKWEKK^ 
25 LGGVKJ^GKTAFGFNGTLENIKFFNSAU^EETVKKMTTNAVTGHLIYTANDTTGSNYFRIPVLYTFSNGRVFSSIDA 
RYGGTHDFLNKINIATSYSDDNGKTWTKPKLTLAFDDFAPVPLEWPREVGGRDLQISGGATYIDSVIVEKKNKQVLM 
FADVM P AGVS FREATRKDSG YKQ I DGNYYLKLRKQGDTD Y^^YT I RENGTVYDDRTNRPTE FS VDKNFG I KQNG 
VER 

30 ID214 - 4185 

ATGAAAAAArrrAGCCTATTACTAGCrrATCCTACCATTTTTGGTTGCCTGTGAGAATC;^ 

TAGCGCTCAAAAGACAATCGTCCTTGCTACAGCTGGCGACGTGCCACCATTTGACTACGAAGACAAGGGCAATCTGA 

CAGGCTTTGATATCGAAGTTTTAAAGGCAGTAGATGAAAAACrrCAGCGACTACGAGATTCAATTCCAA^ 

TGGGAGAGCATCTTCCCAGGACTTGATTCTGGTCACTATCAGGCTGCGGCCAATAACTTGAGTTACACAAAAGAGCG 

35 TGCTGAAAAATACCTTTACTCGCTTCCAATTTCCAACAATCCCCTCGTCCTTGTCAGCAACA^ 

CTTCTCTTGACCAGATCGCTGGTAAAACAACACAAGAGGATACCGGAACTTCTAACGCTCAATTCATCAATA^ 
AATCAGAAACACACTGATAATCCCGCTACAATTAATTTTTCTGGTGAGGATATTGGTAAACGAATCCTAGACCTTGC 
TAACGGAGAGTTTGATTTCCTAGTTTTTGACAAGGTATCCGTTCAAAAGATTATCAAGGACCGTC^ 
CAGTCGTTGATTTACCTTCTGCAGATAGCCCCAGCAATTATATCATTTTCTCAAGCGACCAAAAAGAGTTTAAAGAG 

40 CAATTTGATAAAGCGCTCAAAGAACTCTATCAAGACGGAACCCTTGAAAAACTCAGCAATACCTATCTAGGTGGTTC 
TTACCTCCCAGATCAATCTCAGTTACAATAA 



MKKFSLLIAILPFLVACENQATPKETSAQKTIVLATAGDVPPFDYEDKGNLTGFDIEVLKAVDEKLSDYEIQFQRTA 
45 WESIFPGIJ^SGHYQAAANNLSYTKXRAEKYLYSLPIS^rNPLVLVSNKKNPLTSLDOIAGKTTQEDTGTSNAOFI^^ 
NOKHTDNPATINFSGEDIGKRIIX)IJ^GEFDFLVFDKVSVOKIIKDRGIJDLSVVDLPSADSPSNYIIFSSDQKEFKE 
QFDKALKELYODGTLEKLSNTYLGGSYLPD0SQLQ2 

ID215 - 4211.1 

50 atgaaaaaaaatagtttatatatcatatcctcactcttttttgcttgtgtcttatttgtctatgctactc 

ttttcaaaacagtaccagtgctaggcaggtaaaaacggaaacctatactaatacagtaacaaatgtccctatt^ 
tacgctataatagtgataagtattttattagcggttttgcttcagaagtatcagtggtcttgactggtgcaaatcgc 
ctatcgctagctagtgaaatgcaagaaagtacacgtaaattcaaggttactgctgacctaacagatgccggtgttgg 
aacgattgaagttcctttgagcattgaagatttacccaatgggctgaccgctgtggcgactccgcaaaa;^ 

55 tcaagattggtaagaaggctcagaaggataaggtaaagattgtaccagagattgaccctagtcaaattgatagtcgg 
gtacaaattgaaaatgtcatggtgtcagataaagaagtgtctattacgagtgaccaagagacattggatagaattga 
taagattatcgctgttttgccaactagasaacgtataacaggtaattacagtggttcagtacctttgcaggcaatc 
accgcaatggtgttgtcttaccggcagttatcactccgtttgatacaataatgaaggtgactacaaaaccagtagca 
ccaagttcaagcacatcaaattcaagtacaagcagttcatcggagacatcttcgtcaacgaaagcaactagttcaaa 

60 aacgaattaa 
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MK3CNSLYI ISSLFFACVLFVYATATTJFQNSTS ARQVKTETYTNTVTWP ID I R 
I^IJ^EMQESTRKFKVTADLTDAGVGTIEVPLSIEDLPNGLTAVATPQKITVKIGKKAOmW 
V0IENVMVSDKEVSITSD0ETIJ3RIDKIIAVLPTSERITG^^X•SGSVPLQAIDRNGVVLPAVITPFDTIMKV1^ 
5 PSSSTSNSSTSSSSETSSSTKATSSKTN2 

rD216 - 4127.3 

ATGTTGATTGGCGAAGGGTATCGGACTTTCCCTGTCCTGATTTATACCCAATTTATTAGCGAGGT^^ 
TGCITTTGCAATTATGGCGArrATCATTGCCTrGGCAATTTTCCTTATCC^^ 
1 0 TCAGCATGAATCTGCTCCATCCAATTGAGCCTAAAAAAACTACAAAAGGAAAAATGGCTGCC^ 
TACGGAATTATCTTTATCTCTGTTTTACCTCAAATCTACTTAATTTATACCT 

ATCTGTTAAAGGTTATTCTCCAAACAGTTACAAGGTAG Ci i i CCATCGTATGGGATCTGCTATnTCAATACCATTC 
GTATCCCTTTGATTGCCTTAGTTCTAGTTGTTCTATTTGCGACATTTATCTCCT 
TTGTTTACAAACTTAATTGACAGCCTCAGTATGGTACCTTATATTGTACCAGGAACCGTTCT 
15 TTCTTCCTTCAATACTGGTCTATTTGGAAGTGGATTTCTTATGATTACAG^ 

TATCnSCCAGAAGATTACCTTATACTATTCGCTCATCTGTTGCTAGCTTACAAC;^ 

GCTGCTGAAAGCTTAGGAAGTAGTCGTCTCAATACCTTTGCTAAGATTACAACTCCAATGATC 

TTCTGGAGCCATCTTATCTTGA 

20 

MLIGEGYRTFPVT.iyTOFISEVGGNSAFAIMAIIIAIJ^IFLI0KHIANRYSFSMNLLHPIEPKKTTKGKM^ 
YGIIFISVLPOIYLIYTSFLKTSGMVSVKGYSPNSYKVAFHRMGSAIFNTIRIPLIALVLVVLFATFISyLAVRKRN 
LFTNLIDSLSMVPYIVPGTVLGIAFISSFNTGLFGSGFLMITGTAFILIMSLSARRLPYTIRSSVASLQQIAPSIEE 
AAESLGSSRLNTFAKITTPMMLSGI ISGAILS2 

25 
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Table 4 

^\^P301 

ATGAATAAGAAAAAAATGATTTTAACAAGTCTAGCCAGCT 
TAOTTTTGTAAGAGCAGAAGAATCTCCACAAGTTGTCGAAAAATCrrC^ 

^GCTGATACTGCCAAGAAAGATTACGAAACGGCTAAAAAGAAAGCAGAAGACGCTCAGAAAAAGTAT^ 
GATCAG 

tcaaaaIx^tataaagagtaccgagaagttcaaaatcaacgtagtaaatataaatctgacgctgaatatcaga;^ 
aattaacai^ggtcgacrctaaaatagagaaggctaggaaagagcaacaggacttgcaaaat^^ 

AGAGCAGTTGTAGTTCCTGAACCAAATGCGTTGGCTGAGACTAAGAAAAAAGCAG^ 

agtagctaagac^aaaatatgattatgcaactctaaaggtagcactagcg;^ 

aaattgaaaaacttcaatatgaaatttctactttggaac^ 

aaaaaacttcttgc^ggtgcggatcctgatgatggcacagaagttatagaagctaaatta 

gctaaacgctaaacaagctgagttagcaaaaaaacaaacagaacttgaaaaacttc^ 

gtaagactcaggatgaafrragataaagaagcagaagaagctgagttggataaaaaagctc^ 

gttgctgatttagaaaaa^aaattagtaaccttgaaatattacntggaggggcrg 

tcttcaaaataaattagctgctaaaaaagcrgagttagc^^ 

ttgatcctgaaggtaagactc^tcatgaattagataaagaagcagaagaagctgagttggataaaaa^ 

CTTCAAAATAAAGTTGCTGATrt;AGAAAAAGAAArrAGTAACCT^ 

tgatactgctgctcttc:aaaata)*attagctactaaaaaagctgaattggaaaa 

ctcttaatgagttaggccctgatgg^^atgaagaagaaactccagcgccggctcctcaacca 

gcaccaaaaccagagcaaccagctcc^ctccaaaaccagagcaaccagcrcct 

tccagctccaaaaccagagcaaccagct^cagctccaaaaccagagcaaccagctaagccggaga^ 

agcctactcaaccagaaaaaccagccactccaaaaacaggctggaaacaa 

ACTGATGGTTCAATGGCAATAGGTTGGCTCGT^AAACAACGGTTCATGGTACTACCT 

AACAGGTTGGGTGAAAGATGGAGATACCTGGT^CTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAA^ 

AAGTATCAGATAAATGGTACTATGTCAACAGCmTGGCGCTATGGCGACAGGCTGGCTCCAATAC^ 

TACTACCTCWVCGCTAATGGTGATATGGCGACAGC^TGGCTCCAATACAACGGTTC^ 

TGGTGATATGGCGACAGGATGGGCTAAAGTCAACGhTTCATGGTACTACCTAAACGCTAACGGTGCTATGG 

gttgggctaaagtcaacggttcatggtactacctaaAcgctaacggttcaatggcaacagg^ 

gatacctggtactatcttgaagcatcaggtgctatgaww3caagccaat 

tgtcaatggcttaggtgcccttgcagtcaacacaacrgtagatggctataaagtcaat^ 

AA 



MNKKKMILTSIJ^VAILGAGFVTSOPTFVW^ESPOVVEKSSLBKKYEEAKAKADTAKro 

DQKRTSEKARKEAEASOKI^VALWQNAYKEyREVQNORSKYKMAEYQKKLTBVD 

RAVWPEPNAIJ^TKKKAEEAKAEEKVAKRKYDYATLKVAIJUCKSV^ 

KKLlAGADPDDGTEVIEAKLKKGEAEIJ^AKQAErjUCKQTELEKLLDSDDPEG 

VADLEKE I SNLE I LLGGAD PEDDTAALQNKLAAKKAJEIJ^KKQTELEKiSe^IuD PEGKTQDEL^ 

LONKVADLEKE I SNLE I LLGGADSEDDTAALONKIJ^TKKAELEKTOKELdXaI^EL^ 

apkpeopapapkpeopapapkpeopapapkpeqpapapkpeqpakpekpaeeptqpekpatpfctgwkqengmwyfyn 
tdgsmaigwlo^mgswyylnangamatgwkdgdtwyyleasgamkasowfkvbdicwyyw 
yyl^angd^^tgwlqyngsmyylnangd^^tgwakvngswyylnangamatgwaib\^gsmyy^ 

DTVm'LEASGAMKASQWFK^rsDKWYYWGUSAIAVNTTVDGYK^ 
IP30 2 

ATGTTTGCATCAAAAAGCGAAAGAAAAGTACATTATTCAATTCGTAAATTTAGTGTTGGA^ 

TGCCAGTCTTGTTATGGGAAGTGTGGTTCATGCGACAGAGAACGAGGGAGCTACCCAAGTACCCAC^ 

GGGCAAATGAAAGTCAGGCAGAACAAGGAGAACAACCTAAAAAACTCGATTCAGAACGAGATA^^ 

GTCGAGGAATATGTAAAAAAAATAGTGGGTGAGAGCTATGCAAAATCAACTAAAAAGCGACATAG^ 

TCTAGTTAACGAGTTGAACAACATTAAGAACGAGTATTTGAATAAAATAGTTGAATCAACCT^ 

AGATACTGATGATGGAGAGTCGATCAAAAGTAGATGAAGCTGTGTCTAAGTTTGAAAAGGACrCATCna^ 

AGTTCAGACTCTTCCACTAAACCGGAAGCTTCAGATACAGCGAAGCCAAACAAGCCGACAGAACCAGG^ 

agcagaagctaagaagaaggttgaagaagctgagaaaaaagccaaggatcaaaaagaagaagatcgtcc^^ 
caaccattacttacaaaacgcttgaacttgaaattgctgagtccgatgtggaagttaaa;^ 

GTAAAAGTGAAAGCTAACGAACCTCGAGACGAGCAAAAAATTAAGCAAGCAGAAGCGGAAGTTGAGAGTAAAC^ 

tgaggctacaaggttaaaaaaaatcaagacagatcgtgaagaagcagaagaagaagctaaacgaagagcagatgc 

AAGAGCAAGGTAAACCAAAGGGGCGGGCAAAACGAGGAGTTCCTGGAGAGCTAGCAACACCTGATAAAAAAGAA^ 

gatgcgaagtcttcagattctagcgtaggtgaagaaactcttccaagcccatccctgaaaccagaaaaaaagct 

AGAAGCTGAGAAGAAGGTTGAAGAAGCTAAGAAAAAAGCCGAGGATCAAAAAGAAGAAGATCGCCGTAACTACCCAA 
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15 



20 



25 2 



30 



CCAATACTTACAAAACGCITGAACTTGAAATTGCrGAGTCCGATGTGGAAGTTAAAAAAGCGGAGCTTGAACTAGTA 
AAAGAGGAAGCTAAGGAACCrCGAAACGAGGAAAAAGTTAAGCAAGOUUVAGCGGAAGTTGAGAGTAAAAAAGCTGA 
GGCTACAAGGTTAGAAAAAATCAAGACAGATCGTAAAAAAGCAGAAGAAGAAGCTAAACGAAAAGCAGCAGAAGAAG 
ATAAAGTTAAAGAAAAACCAGCTGAACAACCACyuVCCAGCaCCGGCTCCAAAAGCAGAAAAACCAGCTCaVGCTCCA 
AAACCAGAGAATCCJVGCTGAACAACCAAAAGCAGAAAAACCAGCTGATCAACAAGCTGAAGAAGACTATGCTCGTAG 
ATCAGAAGAAGAATATAATCTCTTGACTCy^Cy^GCAACCGCCAAAAACTGAAAAACCAGCACAACCATCTACTCCAA 
^5^S5SI22t:^^°*^='^'^ATGTGGTACTTCTACAATACT^^ 

AACAATGGCTCATGGTACTACCTCAACAGCAATGGCGCTATGGCGACAGGATGGCTCCAAAACAATGGTTCATGGTA 
CTATCTAAACGCTAATGGTTCAATGGCAACAGGATGGCTCCAAAACAATGGTTCATGGTACTACCTAAACGCTAATG 

TGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGGTGATATGGCGACAGGTTGGGTGAAAGATGGAGA 
TACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATG 
TCAATCCCTCAGGTGCCCTTGCAGTCAACACAACTGTAGATGGCTATGGAGTCAATGCCAATGGTGAA 

MFASKSERKVHYSIRKFSVGVASVWASLVMGSVVHATENEGATQVPTSSNRANESOAEOGEOPKKLDSERDKARKE 
VEEWKKIVGESYAKSTKKRHTITVALVNELNNIKNEVLNKIVESTSESQLQILMMESRSKVDEAVSKFEKDSSSSS 

n^^fc!^52!?.^ °^^^^'^°^'^'^^^'^'^^^^^^^KRRADAKE^ 

DAKSSpSSVG^.ETLPSPSLKPEKKVAEAEKKVEEAKKKAEDOKEEDRRNYPT^^rYKTI£LErAESDVEVKKAELELV 
v^,^o!^?f.^^ ^°°"^^°^*^^^^^'^^^^0PP'^2'^A0PSTPKTGWKQENGMWYFYNTDGS^^^ 

YGVNANGEWVN 



WLQYNGSWYYUJANGDMATGWKDGDTWYYLEASGAMKASQWFKVSDKWyYVNGSGAIAVOTTVDG 



IP3 03 

^I?S^^f^f^°^°If^^^°°*°^A'^°'«»=AACCTG^ 

°™I5f^°^*^'=^°"^°^^"^CCAAGGAACAGATGTAGCAGTAGAACCAGCTAAAGC^^ 

AGTTCATGGTACTACCTCAACAGCAACGGTTCTATGAAAGTOUITCAATGGTTCCAAGTTGGTGGTAA^ 

TCTAAATACATCGGGTGAGTTAGCGGTCAATACAAGTATAGATGGCTATAGAGTCAATGATAATGGTGAATGGGT^ 

GTTAA 



35 MVKRRI RRGTREPEKVWPEOSS I PSYPVSVTSNOGTDVAVEPAKAVAPTTDWKQENGMWYFYNTDGSMATGWVQVN 

SSWYYLNSNGSMKVNOWFOVGGKWYYVNTSGELAVNTSIDGYRVNDNGEWVRZ " iNiUOi.raAi(,wvavN 

ID3 04 

^3 ^^^^^^'^ATAATGTAGATGGAAAGAGTGaACAAGTAACTGAGAGTACTTTAAGTCAAAAAAAAGACTCTCAAAC 

^^?^""^***^°**=caagccccaagttcttctccaagaaattE^ 

CAACTCTTGATAAAAGTCAAGAAGTAGAAGAAGTAGGAGAAATTGGTAAATTACTCTTACTACAATC^^ 
SO ^ff^'^!^^^°^^°^'^'^^°"^°^'^°P^ I VKVATKPKWTKKl SSTRIRYEKDETKDRSENPVTIDGED 

TITYNVDGKSGQVTESTLSQKKDSQTRVVKKRTKPQVLVQEIPIETEYLDGPTLDKSQEVEEVGEIGKLLLLQSIL2 

iE305 

OU ^<=^^=AT<==°=AAGAATATGATAGCCAAGCAGAGAAACGAGTCTATTATTTTGAAGATCAGCGTAGTTATCATACT 
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TTAAAAACTGGTTGGATTTATGAAGAGGGTCATTGGTATTATTTACAGAAGGATGGTGGCTTTGAT^ 

CAGATTGACGGTTGGAGAGCTAGCACGTGGTTGGGTTAAGGATTACCCTCTTACGTATGATGAAGAGAAGCTAAAAG 

CAGCTCCATGGTACTATCTAAATCCAGCAACTGGCATTATGCAAACAGGTTGGCAATATCT 

TACCTCCATTCGTCAGGAGCTATGGCAACTGGCTGGTATAAGGAAGGCTCAACTTGGTACTATCTAGATGCTGAA^ 
5 TGGTGATATGAGAACTGGCTGGCAAAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCT^^ 

• GTTGGTATCAGGAAAGTTCGACTTGGTACTATCTAAATGCAAGTAATGGAGATATGAAAACAGGCTGGTTCCA^ 
AATGGTAACTGGTACTATGCCTATGATTCAGGTGCTTTAGCTGTTAATACCACAGTAGGTGGTTACTACTTA^ 
TAATGGTGAATGGGTTAAGTAA 

10 MKLLKK>IMQIAIATFFFGLI^TNTVFADDSEGWQFVQENGRTYYKKGDLKETYWRVI 

rPAPHKGVTIGPSPRrEIALRPDWFYFGQDGVU3EFVGKQVLEAj<TAmi^KHHGEEYDSQAEKRVYYF 
LKTGWIYEEGHWYYLQKDGGFDSRINRLWGELJ^GWVKDYPLTYDEEKLKAAPWYYI^PATGIMQTCT 
YI*HSSGA^L^TGWYKEGSTWYYLDAENGDMRTGWQ^rU3NK>^^YLRSSGA^lATGWYQESSTWYYLJ^ 
NGNWYYAYDSGALAVNTTVGGYYLNYNGEWVKZ 

15 

ID3 0 6 

TTGGCTGGTAGATATGGTTCTGCTGTTCAGTGTACAGAAGTGACTGCCTCAAACCTTTCAACAGTTAAAA 
TACGGTTGTAGAAAAACCACTGAAAGATTTTAGAGCGTCTACGTCTdATCAGTCTGGTTGGGTGGA^ 
AATGGTATTTCTATGAGTCTGGTGATGTGAAGACAGGTTGGGTGAAAACAGATGGTAAATGGTACTATTTGAATGAC 
20 TTAGGTGTCATGCAGACTGGATTTGTAAAATTTTCTGGTAGCTGGTATTACTTGAGCAATTCAGGTGCTATG 

AGGCTGGGGAACAGATGGTAGCAGATGGTTCTACTTTGACGGCTCAGGAGCTATGAAGACAGGCTGGTACAAGGAAA 
ATGGCACTTGGTATTACCTTGACGAAGCAGGTATCATGAAGACAGGTTGGTTTAAAGTCGGACCACACTGGTACTAT 
GCCTACGGTTCAGGAGCTTTGGCTGTGAGCACAACAACACCAGATGGTTACCGTGTAAATGGTAATGGTGAATGGGT 
AAACTAG 

25 

LAGRYGSAVOCTEVTASNLSTVKTKATVVEKPLKDFRASTSDQSGWESNGK>rfFYESGDVCT 

LGVMQTGFVKFSGSWYYLSNSGAMFTGWGTDGSRWFYFDGSGAMKTGWYKENGTWYYLDEAGIMKTGWFKVGPHWYY 
AYGSGALAVSTTTPDGYRVNGNGEWVN2 

30 ID307 

ATGAAAATTTTGAAAAAAACTATGCAAGTTGGACTGACAGTATTTTTCTT'rGGTTTGCTAGGGACCAGTACAGTATT 
TGCAGATGATTCTGAAGGATGGCAGTTTGTCCAAGAAAACGGAAGAACCTACTACAAAAAGGGGGACCTCAAAGA^ 
CCTACTGGCGAGTGATTGATGGTAAGTACTATTA TT TTGATTCTCTATCTGGAGAGATGGTTGTCGGCTGGCAATAT 
ATCCCGTTTCCA.TCTAAAGGTAGTACAATTGGTCCTTACCCAAATGGTATCAGATTAGAAGGTTT T CCAAAGTCAGA 

3'5 GTGGTACTACTTCGATAAAAATGGAGTGCTACAAGAGTTTGTTGGTTGGAAAACATTAGAGATTAAAACTAAAGACA 
GTGTTGGAAGAAAGTACGGGGAAAAACGTGAAGATTCAGAAGATAAAGAAGAGAAGCGTTATTATACGAACTATTAC 
TTTAATCAAAATCATTCTTTAGAGACAGGTTGGCTTTATGATCAGTCTAACTGGTATTATCTAGCT 
TAATGGAGAAAACTACCTTGGTGGTGAAAGACGTGCGGGGTGGATAAACGATGArrCGACTTGGTACTACCTAGATC 
CAACAACTGGTATTATGCAAACAGGTTGGCAATATCTAGGTAATAAGTGGTACTACCTCCGrrCCTCAGGAGCAATC 

40 GCCACTGGCTGGTATCAGGAAGGTACCACTTGGTATTATTTAGACCACCCAAATGGCGATATGAAAACAGGTTGGCA 
AAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTGGTTGGTATCAAGATOT 
GGTACTACCTAAATGCAGGTAATGGAGACATGAAGACAGGTTGGTTCCAGGTCAATGGCAACTGGTACTATGCTTAT 
AGCTCAGGTGCTTTGGCAGTGAATACGACCGTAGATGGCTATTCTGTCAACTATAATGGCGAATGGGTTCGGT^ 

45 MKILKKTMQVGLTVFFFGLLGTSTVFADDSEGWQFVQENGRTYYKKGDLKETYWRVIDGKYYYFDSLSGEMVVGWQY 
IPFPSKGSTIGPYPNGIRLEGFPKSEWYYFDKNGVLQEFVGWKTLEIIO'KDSVGRKYGEKREDSEDKEEKRYYTNYY 
FNQ^mSLETGWLYDQS^^4YYtAKTEINGENYIXSGERRAGWI^roDSTWYYLDPTTGIMQTGWQY^^ 
ATGWYQEGTTWYYtX>HPNGDMKTGWQNLGNKWYYLRSSGA^L^TGWYQDGSTWYYl^AGNGDMKTGWFQW 
S S GAUSlVNTTVDGYS VN YNGEWVRZ 

50 

ID3 0 8 

ATGAAAAAGAAATTAACTAGTTTAGCACTTGTAGGCGCTTTTTTAGGTTTGTCATGGTATGGGAATGT^ 
AGAAAGTTCAGGAAATAAAATCCACrTTATCAATGTTCPAGAAG^ 

55 GACATTTTGCCATGGTGGATACAGGAGAAGATTATGATTTCCCAGATGGAAGTGATTCTCGCTATCCATGGAGAGAA 
GGAATTGAAACGTCTTATAAGCATGTTCTAACAGACCGTGTCTTTCGTCGTTTGAAGGAATTGGGTGTCCAAA^ 
TGATTTTATTTTGGTGACCCATACCCACAGTGATCATATTGGAAATGTTGATGAATTACTGTCTACCTATCCAGTTG 
ACCGAGTCTATCTTAAGAAATATAGTGAtAGTCGTATTACTAATTCTGAACGTCTATGGGATAATCTGTATGGCTAT 
GATAAGGTTTTACAGACTGCTGCAGAAAAAGGTGTTTCAGTTATTCAAAATATCACACA^ 

60 GTTTGGGGACATGGATATTCAGCTCTATAATTATGAAAATGAAACTGATTCATCGGGTGAATTAAAGAAAATCT 
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atgacaattccaattcctit^ttagcgtggtgaaagtcaatggcaagaaaatttacc^ 
gttcatggagcagaagacaagtatggtcctctcattggaaaagttgatttgatgaagttt;^ 
caacaaatcaaataccaaggatttcattaaaaatttgagtccgagtttgattgttcaaa 
ggaaaaatggtgttgatagtgagtatgttaattggctcaaagaacgaggaattgagagaatcaacgcagccagca 

5 GACTATGATGCAACAGT rrT 'X'GATATTCGAAAAGACG G T ' r ' T ' l 'GTCAATATTTCAACATCCTACAAGCCGATTCCAAG 

TTTTCAAGCTGGrrGGCATAAGAGTGCATATGGGAACTGGTGGTATCAAGCGCCTGATTCTACAGGAGA^ - 
TCGGTTGGAATGAAATCGAAGGTGAATGGTATTACTTTAACCAAACGGGTATCrrTGTTACAGAATCAATGGA 
TGGAACAATCATTGGTTCTATTTGACAGACTCTGGTGCTTCTGCTAAAAATTGGAAGAAAATCGCT^ 
TTATTTTAACAAAGAAAACCAGATGGAAATTGGTTGGATTCAAGATAAAGAGCAGTGGTATTAT^ 
1 0 GTTCTATGAAGACAGGATGGCTTCAATATATGGGGCAATGGTATTACTTTGCTCCATCAGGGGAAATGAA^ 

TGGGTAAAAGATAAAGAAACCTGGTACTATATGGATTCTACTGGTGTCATGAAGACAGGTGAGATAGAAGTTGCT^ 
TCAACATTATTATCTGGAAGATTCAGGAGCTATGAAGCAAGGCTGGCATAAAAAGGCAAATGAT^^ 
AGACAGACGGTTCACGAGCTGTGGGTTGGATCAAGGACAAGGATAAATGGTACTTCTTGAAAGAAAATGGTC^ 
CTTGTGAACGGTAAGACACCAGAAGGTTATACTGTGGATTCAAGTGGTGCCTGGTTAGTGGATGTTTCGATCGAGAA 

15 

ATCTGCTACAATTAAAACTACAAGTCATTCAGAAATAAAAGAATCCAAAGAAGTAGTGAAAAAGGATCT^ 
AAGAAACGAGTCAACATGAAAGTGTTACAAATTTTTCAACTAGTCAAGATTTGACATCCTCAACT 

gaaacgagtgtaaacaaatcggaatcagaacagtag 

mkkkltsija*vgaflglswyg^rvoaoessgnkihfi^^vqeggsdaiilesnghfa^wdtgedydfpdgsdsrypv^ 
20 gietsy^chv^tdrvfrrlkelgv0k1.dfilvththsdhig^^vdellstypvdrvy 

dkvlotaaekgvsvionitogdahfqfgdmdiolynyenetdssgelkkiwddnsnsliswkvngkkiylggdldn 
vhgaedkygpligkvdlwkfnhhhdtnksotkdfiknlspslivqtsdslpwkkgvdseyvnwl 
dydatvfpirkdgfvnistsykpipsfqagwhksaygnwwyoapdstgeyavgwneiegewyyfnqtgillqnqwkk 
wnnhwfyItdsgasaknwkkiagiwyyfnkenomeigwiqdkeqwyyldvdgsmktgwlo™ 

25 

wkdketwyymdstgvmktgeievagqhyyledsgamkogwhkkandv^fyktdgsravgwikdkd 

LVNGKT P EGYTVDS SGAWLVDVS I EKS ATI KTTSHS E I KES KEWKKD LENKETSQKES VTNFSTSQ 
ETSVNKSESEQZ 

ID309 

30 ATGGAAATTAATGTGAGTAAATTAAGAACAGATTTGCCTCAAGTCGGCGTGCAACCATATAGGCAAGTACACGCACA 
CTCAACTGGGAATCCGCATTCAACCGTACAGAATGAAGCGGATTATCACTGGCGGAAAGACCCAGAATTAG^ 
TCTCGCACATTGTTGGGAACGGTTGCATCATGCAGGTAGGACCTGTTGATAATGGTGCCTGGGACGTTGGGGGCGGT 
TGGAATGCTGAGACCTATGCAGCGGTTGAACTGATTGAAAGCCATTCAACCAAAGAAGAGTTCATGACGGACTACCG 
CCTTTATATCGAACTCTTACGCAATCTAGCAGATGAAGCAGGTTTGCCGAAAACGCTTGATACAGGGAGTTTAGCTG 

35 GAATTAAAACGCACGAGTATTGCACGAATAACCAACCAAACAACCACTCAGACCACGTTGACCCTTATCCATATC^ 
GCTAAATGGGGCATTAGCCGTGAGCAGTTTAAGCATGATATTGAGAACGGCTTGACGATTGAAACAGGCTGGCAGAA 
GAATGAC ACTG GCTACTGGTACGTACATTCAGACGGCTCTTATCCAAAAGACAAGTTTGAGAAAATCA^ 
GGTACTACTTTGACAGTTCAGGCTATATGCTTGCAGACCGCTGGAGGAAGCACACAGACGGCAACTGGTACTGGTTC 
GACAACTCAGGCGAAATGGCTACAGGCTGGAAGAAAATCGCTGATAAGTGGTACTATTTCAACGAAGAAGGTGCCAT 

40 GAAGACAGGCTGGGTCAAGTACAAGGACACTTGGTACTACTTAGACGCTAAAGAAGGCGCCATGGTATCAAATGCCT 
TTATCCAGTCAGCGGACGGAACAGGCTGGTACTACCTCAAACCAGACGGAACACTGGCAGACAAGCCAGAATTCACA 
GTAGAGCCAGATGGCTTGATTACAGTAAAATAA 

MEI^^VSKIJiTDLPQVGVOPYROVHAHSTGNPHSTVQNEADYHWRKDPELGFFSHIVGNGCIMOVGPVDNGAWDVGGG 
45 WNAETYAAVELIESHSTKEEFMTDYRLYIELLRNlJUDEAGLPKTLDTGSIJVGIKTHEYCrNNOPNr^ 
AKWGISREQFKHDIENGLTIETGWQKNDTGYVrrVHSDGSYPKDKFEKINGTWYYFDSSGYMI^ 

DNSGEMATGWKKIADKWYYFNEEGAMICTGWVKYKDTWYYLDAKEGA>fVSNAFIQSADGTGWYyLKPIX31^^ 
VEPDGLITVK2 

50 rD3io 

atg ggc acaacaggatttac:aataattgacttap.ttatcttgattgtttattt 

ctattt ctcta aaaaagagatgaaaggaaaagagttctttaaaggagatggttcggttccttggtatgttact 

TATCCATTTTTGCCACAATGCTCAGTCCGATTTCCTTCTTGGGACTCGCTGGTAGCrCTTATGCAG 
TTATGGTTTGCTCAATTAGGGATGGTAGTAGCTATTCCACTGACAATTCGTTTTATCTTACCTATCTTTGCAC^ 

^5 AGACATCGATACGGCATATGATTACTTGGATAAACGTTTTAATTCTAAAGCACTTCG 

TTATTATTTA TCAA TTGGG ACGT ATGTCTATCATTATGTACCrrCCCATCAGCTGGTTTATCAGTAT^ 
GACATCAAT ATTT TGATTATTTTGATGGGTGTAGTTGCAATTGTTTATTCTTATACT 
ATGGACA GACT TTATTCAAGGTGTGATTCTGATTAGTGGTGTCGTTTTAGCTTTATT^ 
AAGGTGGCmGGTGCAGTAGCAGAAACATTAGCAAACGGGA 

OU AACITGCTTTCAAACTCCATCTTTTTAATTGTGATGGGTTCAGGCTTTAC^ 
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AGAtTTGGrrCAACGTTTTACTACAACACAAAATATTAAGAAACTTAATAAGATG^ 
CACTTGCAACTGCy^CAGTCTTTTACTTGATTGGTACAGGCTTGTACGTACT 
GCAGCTAGCAATATCCCTCAAGACCAAATCTTTATGTACTTTATTGCATACCAGTTA^ 
GATCTTGGCAGCGATTTATGCAGCATCTCAATCAACTATTTCAACAGGTTTGAACT 
5 TGGATATTCAAGATGTCATTTCTAAAAATATGTCZAGACAATCGTCGTACGAAAATTGCACAATTCGT^ 
GTAGGTTTATTCTCAATTGGTGTTTCCATTGTCATGGCTCACTCAGATATTA^ 
TTTCATGGGACTTGTACTTGGTCTACTTGGTGGTGTATTTATTCTTGGA 
GTGCTTATGCAGCGCTGATTGTATCAACCATCGTCATGGTATTTATTAAATACTTCCTTCCT 
TACTGGGCATATTCATTGATTTCAATCTCTGTATCAGTAGTTTCAGGTTATATTG^ 
10 AGTATCTGCACCTAAATATACAACGATTCATGATATTACAGAAATTAAAGCGGATTCAAGTTGGGAAGT^ 
AA 

MGTTGFT'IIDLIILIVYLIJVVLVAGIYFSKKEMKGKEFFKGDGSVPWYVTSVSIFATMLSPISFLGIAG 
LWFAOL^MWAXPLTIRFILPIFARIDIDTAYDyLDKRFNSKALRIISALLFIIYQLGRMSIIMYLPSAGLSVLTGI 
15 DINILII1^GWAIVYSYTGGLKSVLWTDFIQGVILISGWIALF\A-IANIKGGFGAVAETLANG^ 
tn.LSNSIFLIVMGSGFTILSSyASSODLVQRFTrrONIKKl^KMLFTNGVLSIATATVFYLIGTC 
AASNIPQDQIFMYFIAYQLPVGITGLIU^IYAASOSTISTGLNSVATSWTIJJIQDVISKNMSDNRRTKIAQFVSIA 
VGIJS I GVS I VMAHSD I KS A YEWFNSFMGLVI^LI^VFII^FVS KKANKQGAYAALI VST I 
YWAYSLISISVSWSGYIVSVLTGNKVSAPKYTTIHDITEIKADSSWEVRHZ 

20 

ID311 

ATGAAAATTAATAAAAAATATCTAGCJ^^GGTTCAGTGGCAGTCCTTGCCCTAAGTGTTTGTTCCTA 

TCACCAAGCTGGTCAGGATAAGAAAGAGTCTAATCGAGTTGCTTATATAGATGGTGATCAGGCTGGTCAAAAGGCAG 

AAAACTTGACACCAGATGAAGTCAGTAAGAGGGAGGGGATCAACGCCGAACAAATCGTCATCAAGATTACTC 

25 GGTTATGTGACCTCrCATGGAGACCATTATCATTACTATAATGGCAAGGTCCCITATGATGCCATCAT 
GCrCCTCATGAAAGATCCGAATTATCAGTTGAAGGATTCAGACATTGTCAATGAAATCAAGGGTGGTTA 
AGGTAGACGGAAAATACTATGTTTACCTTAAGGATGCAGCTCATGCGGATAATATTCGGACAAAAGAAGAGATTAAA 
CGTCAGAAGCAGGAACGCAGTCATAATCACGGGTCAGGAGCTAACGATCATGCAGTAGCTGCAGCCAGAGC 
ACGCTATACAACGGATGATGGGTATATCTTCAATGCATCTGATATCATTGAGGACACGGGTGATGCTTATA 

30 CTCACGGCGACCATTACCATTACATTCCTAAGAATGAGTTATCAGCTAGCGAGTTAGCTGCTGCAGAAGCCTATTGG 
AATGGGAAGCAGGGATCTCGTCCTTCTTCAAGTTCTAGTTATAATGCAAATCCAGCTC^ 
CCACAATCTGACTGTCACTCCAACTTATCATCAAAATCAAGGGGAAAACATCT 

CTAAACCCTTATCAGAACGCCATGTGGAATCTGATGGCCTTATTTTCGACCCAGCGCAAATCACAAGTCGAACCGCC 
AGAGGTGTAGCTGTCCCTCATGGTAACCATTACCA C TTTATCCCTTATGAACAAATGTCTGAATTGGAAAAACGAAT 

35 TGCTCGTATTATTCCCCTTCGTTATCGTTCAAACCATTGGGTACCAGATTCAAGACCAGAACAACCAAGTCCAC 
CGACTCCGGAACCTAGTCCAAGTCCGCAACCTGCACCAAATCCrCAACCAGCTCCAAGCAATCCAAT^ 
TTGGTCAAAGAAGCTGTTCGAAAAGTAGGCGATGGTTATGTCTTTGAGGAGAATGGAGTTTCTCGTTATATCCCAGC 
CAAGGATCTTTCAGCAGAAACAGCAGCAGGCArrGATAGCAAACTGGCCAAGCAGGAAAGTTTATCTC^ 
GAGCTAAGAAAACTGACCTCCCATCTAGTGATCGAGAATTTTACAATAAGGCTTATGACTTACTAGCAAG^ 

40 CAAGA7TTACTTGATAATAAAGGTCGACAAGTTGATTTTGAGGCTTTGGATAACCTGTTGGAACGACTCAAGGATGT 
CCCAAGTGATAAAGTCAAGTTAGTGGATGATA7T C TTGCCTTCTTAGCTCCGATTCGTCATCCAGAACG7TTAGGAA 
AACCAAATGCGCAAATTACCTACACTGATGATGAGATTCAAGTAGCCAAGTTGGCAGGCAAGTACACAACAGAAGAC 
GGTTATATCTTTGATCCTCGTGATATAACCAGTGATGAGGGGGATGCCTATGTAACTCCACATATGACCCATAGCCA 
CTGGATTAAAAAAGATAGTTTGTCTGAAGCTGAGAGAGCGGCAGCCCAGGCTTATGCTAAAGAGAAAGGTTTGACCC 

45 CTCCTTCGACAGACCATCAGGATTCAGGAAATACTGAGGCAAAAGGAGCAGAAGCTATCTACAACCGCGTGAAAGCA 
GCTAAGAAGGTGCCACTTGATCGTATGCCTTACAATCTTCAATATACTGTAGAAGTCAAAAACGGTAGTTT^ 
ACCTCATTATGACCATTACCATAACATCAAATTTGAGTGGTTTGACGAAGGCCTTTATGAGGCACCT^ 
CTCTTGAGGATCrTlTGGCGACTGTCAAGTACTATGTCGAACATCCAAACGAACGTCCGCATTCAGATAATGG l I X T 
GGTAACGCTAGCGACCATGTTCAAAGAAACAAAAATGGTCAAGCTGATACCAATCAAACGGAAAAACC^ 

50 GAAACCTCJ^GACAGAAAAACCTGAGGAAGAAACCCCTCGAGAAGAGAAACCGCAAAGCGAGAAACCAGAGTCTCCA^ 
AACCAACAGAGGAACCAGAAGAATCACCAGAGGAATCAGAAGiLACCTCAGGTCGAGACTGAAAAGGTTG^ 
CTGAGAGAGGCTGAAGATTTACTTGGAAAAATCCAGGATCCAATTATCAAGTCCAATGCCAAAGAGACTCTCACAGG 
ATTAAAAAATAATTTACTATTTGGCACCCAGGACAACAATACTATTATGGCAGAAGCTGAAAAACT 
TAAAGGAGAGTAAGTAA 

55 

MKINKKYI^GSVAVIJa.SVCSYEI^RHOAGQDICKESNRVAYIDGDOAGQKAENLTPDEVSKREGINA£OIVIKITDQ 
GYVTSHGDHYHYYNGKVPYDAIISEELLMKDP^ry'QLKDSDIVNEIKGGYVIKVDGKYYVYLKID 
RQ KOERSHraGS GANDHAVAAMtAOGRYTTDIXSY I FNASD 1 1 EDTGDAY I VPHGDHYHYI P KIJELS AS EIJ^^ 
NGKOGSRPSSSSSYNANPAOPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTA 
60 RGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHVA/PDSRPEQPSPQSTPEPSPSPQPAPNPQPAPSNPXDEK 
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LVKEAVRKVGMYVreENGVSRYIPAKDLSAETAAGIDSKIAKOESLSHKI/SAKKTDLPSSDREF^^ 
QDLLDNKGRQVDFEAl-DNLLERLKDVPSD KVKLVDD I LAFLAP I RHP ERLG KPNAQ I TYTDDE I QVAKIAGICnTED 
GYIFDPRDITSDEGDAYVTPHNrniSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGirrEAK^ 
AKKVPUDRMPyNt^YTVEVKNGSLIIPHYDHYHNIKFEWroEGLYEAPKGyTI^ 
5 GNASDHVQRNKNGOADTNQTEKPSEEKPOTEKPEEETPREEKPOSEKPESPKPTEEPEESPEESEEPQVETEKVEEK 
IJiEAEDI^GKIQDPIIKSNAKETLTGLKNNI-LFGTaDNNTIMAEAEKLLALLKE 

ID312 

ATGGAGGGATTGGTTAGAGTGCATTTATTGCCTGTATTTGGCGATTACAAGCTATCTAAACT^ 
1 0 TCAACAGCAAGTAAACAAATGGGCTGACAAGGCyL^^TAAAGGCGAAAAA^^ 

ATAACATGAATAAGCGTATTTTGAAATATGGCGTAGCTATCCAGGTAATACAATACAACCCAGCTAAT^ 

GTTCCACGCAAACAGCAAAAAGAAAAGGCTGCrrGTCAAATACT^ 

TTTAGATGCTCTGGATCAATCAAATTATGAGAACTTATTTGATGTTGT^ 

GCCGTATTAGTGAGGCTCTGGCTCTTGAATGGTCTGATATTGACCTAGAAAGCGGTGTTATCAGCATCAATAAC^ 
CTAAACCGCTATCAGGAAATAAACTCACCTAAATC^ 

ACTTTTACTGAAACAATACAAAAACCGTCAACAAATTCAGTCTTGGAAATTAGGCCG 
CTGTATTTACGGAGAAATATGCTTATGCTTGTAACTTACGCAAACGCCTAAATAAGCATT^ 
ACTAACGTATCATTTCATGGTTTCCGCCATACACATACTACrATGATGCTCTATGCTCAGGI^^ 
TCAGTATAGATTAGGCCACTCTAATrTA^TGATOVCTGAAAATACTTACrGGCATACTAACC^ 
20 AAGCCGTCTCAAATTATGAAACAGCTATCAACAATTTATAA 

MEGLVRVHLLP VFGD YKLS KLTTP I LQQQVNKWADKANKGE KGAFANYS LLHNN^ 
VPRK00KEKAAVICYUDNKELKQFLDYU5AlXOShnfENLFDWLYKTLIATGCRISEAI^ 
L^^lYOEINSPKSSAGYRI^IPIDKATLI^KOYKNRQQIQSWKLGRSETWFSVFTEKYAYACNLRK^ 
25 T^^VSFHGFRHTHTTmLYA0VSPKDV0yRLGHS^^IJ4ITE^^^YWHTN0ENAKKAVSNYETAI^^ 



